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CROSS-REFERENCES TO RELATED APPLICATIONS 
5 This application claims the benefit of U.S. Provisional Application 

No.60/358,986 filed February 21, 2002, which is incorporated herein by reference. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT 

This invention was made with government support under grant no. UOl 
1 0 CA85067 awarded by the National Cancer Institute. The government may have certain 
rights in this invention. 

BACKGROUND OF THE INVENTION 
Prostate cancer is the most common form of cancer in males. It typically 
afflicts aging males, but it can afflict males of all ages. A significant number of males die 
1 5 fiom prostate cancer every year, and it is the second leading cause of cancer deaths in 
men. Early diagnosis of prostate cancer in patients reduces the likelihood of death. 

The effectiveness of any diagnostic test depends upon its specificity and 
selectivity. That is, what is the relative ratio of true positive diagnoses, true negative 
diagnoses, false positive diagnoses and false negative diagnoses? Methods of increasing 
20 the percentage of trae positive and true negative diagnoses for any condition are desirable 
medical goals. In the case of prostate canc^, the preset diagnostic tests are not 
completely satisfactory, in that they provide significant numbers of false positive and 
false negative results. 

Conventionally, prostate cancer is diagnosed using prostate specific 
25 antigen (PSA) as a marker. In general, PSA levels above 4 ng/ml are suggestive of 
prostate cancer while levels above 10 ng/ml are highly suggestive of prostate cancer. 
However, if the cancer is in its early stages, some prostate cancer patients exhibit normal 
PSA levels at the tune of diagnosis. Since conventional PSA tests detect abnormal levels 
of PSA, conventional PSA tests may not be able to detect the presence of prostate cancer 
30 ifit is in its early stages. This results in a false negative diagnosis. The inability of 
conventional PSA tests to diagnose the presence of prostate cancer in some instances 

in the early stages of the disease) can be detrimental to the patient Moreover, many 
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individuals with elevated levels of PSA in the blood senim may not have prostate cancer, 
but may instead have benign prostate hyperplasia (BPH) (/.e., a benign tumor) or prostate 
intraepithelial neoplasia (PIN). This results in a false positive diagnosis. In order to 
determine if a person has prostate cancer, rather than BPH or PIN, additional 

5 immunoassays using other antibodies and/or biopsies of the prostate tissue are performed. 
These additional tests are time consuming and expensive for both patients and their care 
providers- 
There is some consensus in the medical community that better diagnosis 
will result from the discovery of more disease markers that can be used alone or in 

10 combination to increase the specificity and selectivity of diagnostic tests. 

SUMMARY OF THE INVENTION 

Several organic biomolecules (e-g., proteins, glycoproteins, nucleoproteins 
and the like) have been discovered that possess molecular weights capable of functioning 
as markers in prostate cancer C*PCA")> prostate intraepithelial neoplasia ('TIN"), or 

15 benign prostate hyperplasia ("BPH") versus a negative diagnosis. Compared to a 

negative diagnosis, the markers are, variously, more frequently detected, less frequently 
detected, or differentially detected. The detection and amount (either absolute or relative) 
of these markers, alone or in combination, in patient samples provides information that 
the diagnostician can correlate with probable diagnoses of prostate cancer, benign 

20 prostate hyperplasia or with a negative diagnosis (e.g., normal or disease-free). Each 
mark» can be distinguished from other biomolecular masses as the source from which 
the markers are derived is controlled and/or particular purification procedures are 
employed that minimize the potential of false positives, as described herein. As is also 
described herein, there are several methods that allow resolution and detection of the 

25 marker molecular weights. In preferred embodiments, the method of resolution involves 
Surface-Enhanced Laser Desorption/Ionization mass spectrometry, in which the surface 
of the mass spectrometry probe plays an active role in the desorption and ionization of the 
analyte. 

A set of markers was identified in prostate epithelial cell lysates derived 
30 from laser capture microdissection (LCM). The Marker Set includes the following: 
Marker EPl : 3448 i 19 Da, Marker EP2: 4036 ± 22 Da, Marker EP3: 4361 ± 24 Da, 
Marker EP4: 4413 ± 24 Da, Marker EPS: 4639 db 26 Da, Marker EP6: 4749 db 26 Da, 
Marker EP7: 4827=b 27 Da, Marker EPS: 5666 ± 31 Da, Marker EP9: 8445 ± 46 Da, 
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Marker EPIO: 1 1744 ± 65 Da, Marker EPl 1: 14696 ± 81 Da, Marker EP12: 24184 ± 133 
Da, Marker EP13: 48308 ± 266 Da, and Marker EP14: 53830 ± 296 Da. These markers 
are further characterized by the ability to bind to a metal chelate adsorbent and washed 
with an eluant at neutral pH. This implies that they contain metal binding residues. 
5 Markers EP2, EP3, EPS, EP6 and EP12 occur in a greater frequency and abundance in the 
PIN and PCA cell lysates. Markers EP4, EPS, EP13, and EP 14 appear in greater 
abundance in normal and BPH cell lysates. Marker EPIO had a decreased abundance in 
BPH, PIN, and PCA cell lysates. Markers EPS and EP12 are found only in PCA cell 
lysates. 

10 The in addition to mass characteristics, the markers may also be 

characterized by affinity characteristics. For example, the molecular weight and metal 
binding characteristics are provided herein. We point out that molecular weight and 
metal binding are characteristic properties of these markers and not limitations on means 
of detection or isolation. Furthermore, once partial amino acid sequmces are obtained, 

1 S each marker can be identified unambiguously by matching the sequence in a protein 
database. 

One aspect of the invention is a method of qualifying a prostate cancer 
status in a subject comprising: a) measuring at least one marker in a sample from the 
subject, wherein the marker is selected Scorn the group consisting of those markers noted 

20 above as being identified in prostate epithelial cell lysates and combinations thereof; and, 
b) correlating the measurement with prostate cancer status. 

In one embodiment of this method, the prostate cancer status is selected 
from the group consisting of prostate cancer (PCA), prostate intraepithelial neoplasia 
(PIN), and benign prostate hyperplasia (BPH). In another embodiment the sample is 

25 prostate tissue extract. Still another embodiment is where measuring comprises 

determining the mass of the protein, which an be by mass spectrometry, preferably is gas 
phase ion spectrometry, moro preferably laser desorption ionization mass spectrometry. 

Samples used in the method may be selected from the group consisting of 
blood, serum, urine, prostatic fluid, seminal fluid, semen, and prostate tissue. 

30 Measuring marker amount may fiirther comprise i) firactionating the 

sample; ii) binding a fixiction of the sample to an adsorbent; and, iii) comprises detecting 
the protein marker by gas phase ion spectrometry. Alternatively, marker amount may be 
detected by immunoassay. 
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The absorbent used in the methods may be selected from the group 
consisting of a hydrophilic adsorbent, a metal chelate adsorbent, and a strong anion 
exchange adsorbent. 

In another embodiment of the method, measuring the biomarker(s) 

5 comprises i) embedding a portion of a tissue specimen harvested from a patient in OCT 
and freezing the specimen; ii) obtaining cryosections from the tissue specimen; iii) 
obtaining cell samples from the cryosections by laser capture microdissection; iv) mixing 
cell samples from step (d) with lysis buffer thereby producing cell lysates; v) diluting and 
vortexing the cell lysates; vi) centrifuging the vortexed cell lysates thereby producing a 

10 supernatant fraction; vii) binding the supematant fraction to an adsorbent; and, viii) 
comprising detecting the protein marker using gas phase ion spectrometry. The 
adsorbent used may be selected from the group consisting of a hydrophilic adsorbent, a 
metal chelate adsorbent, and a strong anion exchange adsorbent. The sample may 
comprise any number of marker combinations, for example simply marker EPS, markers 

15 EP2 and EPS, EP2 and EPS, markers EP3 and EPS, markers EP2 and EP6, markers EP3 
and EP6, markers EPS and EP6, markers EP2, EPS and EPS, markers EP2, EPS and EP6, 
or markers EP2, EPS, EP4, EPS, EP6, and EPS. 

In one embodiment, correlating the measurement with prostate cancer 
status includes: i) gen^ting data for each protein marker with the mass spectrometer, the 

20 data comprising a mass/charge ratio and an amount determination for each ion 

corresponding to each protein marker; ii) transforming the data into computer-readable 
form; and, iii) executing an algorithm with a programmable digital computer, wherein the 
algorithm determines closeness-of-fit between the computer-readable data and a data set 
indicating a diagnosis of PCA, PIN, BPH or a negative diagnosis. 

25 In another aspect this invention provides a method for aiding in a diagnosis 

of prostate cancer, prostate intraepithelial neoplasia, or benign prostate hyperplasia. The 
method involves: a) detecting at least one protein marker selected from the Marker Set (as 
defined herein) in a sample from a subject; and b) correlating the detection of the marker 
or markers with a probable diagnosis of prostate cancer, prostate intraepithelial neoplasia, 

SO benign prostate hyperplasia or a negative diagnosis, wherein the correlation takes into 
account the relative detectability of the marker or markers in each diagnosis. 

In certain embodiments, the sample may be selected from seminal plasma, 
blood, serum, urine, prositatic fluid, seminal fluid, semen, and prostate tissue. In the 
preferred embodiment, the marker or markers is detected in ceU lysates from prostate 
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tissue. In other embodiments, the marker or markers are detected by gas phase ion 
spectrometry or, more particularly, laser desorption mass spectrometry. In another 
embodiment, the marker or markers are detected by immunoassay. In other 
embodiments, the method comprises detecting a plurality of the markers. In other 
5 embodiments, the method involves detecting at least marker EP2 and a second marker 
selected from EPS, EPS, or EP6. In another embodiment, the method comprises detecting 
at least marker EPS and a second marker selected from EPS or EP6. In another 
embodiment, the method comprises detecting at least marker EPS and EP6. In another 
embodiment, the method comprises detecting at least markers EP2, EPS and EPS. In 
10 another embodiment, the method comprises detecting at least markers EP2, EPS and EP6. 
In another embodiment the method comprises: i) generating data on the sample with the 
mass spectrometer indicating intensity of signal for mass/charge ratio, ii) transforming the 
data into computer-readable form; and iii) executing an algorithm with a progranmaable 
digital computer, wherein the algorithm detects signal in the computer-readable data and 
1 S wherein detecting the markers is indicative of a diagnosis of PCA, PIN, BPH or a 
negative diagnosis 

In another aspect this invention provides a metiiod for detecting at least 
one protein marker from the Marker Set in a sample, wherein the method comprises 
detecting the marker or markers by gas phase ion spectrometry. 
20 In one embodiment the method comprises: i) generating data on the 

sample with a mass spectrometer indicating intensity of signal for mass/charge ratio, ii) 
transforming the data into computer-readable form; and iii) executing an algoritiim with a 
programmable digital computer wherein the algorithm detects signal in the computer- 
readable data representing the marker or markers and wherein detecting the markers is 
2S indicative of a diagnosis of PCA, PIN, BPH or a negative diagnosis. 

In another aspect of this invention, a marker or multiple markers are 
detected in samples and an artificial intelligence program such as fuzzy logic, cluster 
analysis, or neural network are used to analyze the data. 

In another embodiment the method further comprises i) before detecting 
SO the marker or markers, fractionating a sample comprising the marker or markers by 

contacting the sample with a substrate comprising an adsorbent that retains the marker or 
markers and removing unretained sample; and ii) desorbing and ionizing the retained 
markers fix)m the adsorbent during mass spectrometry. In certain embodiments of this 
method the substrate is a mass spectrometer probe comprising the adsorbent on a probe 
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surface. In another embodiment, the substrate is a resin, and, after fractionating the 
sample, the resin with the marker or markers retained by the adsorbent is placed on a 
mass spectrometer probe for desorption and ionization by the mass spectrometer. In 
another embodiment the adsorbent is selected from a hydrophilic adsorbent, eg., an 

5 anionic adsorbent, and a metal chelate adsorbent, eg., a copper chelate adsorbent. 

In one embodiment involving laser desorption/ionization mass 
spectrometry the method comprises: i) providing a probe adapted for use with a mass 
spectrometer comprising an adsorbent attached thereto; ii) contacting the marker or 
markers with the adsorbent; iii) desorbing and ionizing the marker or markers from the 

1 0 probe; and, iv) detecting the desorbed/ionized marker or markers with the mass 

spectrometer. In another embodiment the method comprises: i) providing a substrate 
comprising an adsorbent attached thereto; ii) contactmg the marker or markers with the 
adsorbent; iii) placing the substrate on a probe adapted for use with a mass spectrometer 
comprising an adsorbent attached thereto; iv) desorbing and ionizing the marker or 

15 markers from the probe; and, v) detecting the desorbed/ionized marker or markers with 
the mass spectrometer. In other embodiments the adsorbent is a hydrophilic adsorbent 
(eg., silicon oxide) or a metal chelate adsorbent (eg., copper chelate). In another 
embodiment, the adsoibent comprises an antibody that specifically binds to the maiker. 

In another aspect the invention provides a purified protein selected fix>m 

20 the protein markers of the Marker Set. In one embodiment, the purified protein is 
produced by a process comprising fiactionating a sample comprising the marker or 
markers by size exclusion chromatography and/or anion exchange chromatography, then 
collecting a fraction that includes the maricer or markers. 

In another aspect this invention provides a kit comprising: (1) an adsorbent 

25 attached to a substrate, wherein the adsorbent retains a protein marker selected from the 
Marker Set; and (2) instructions to detect the marker or markers by contacting a sample 
with the adsorbent and detecting the marker or markers retained by the adsorbent. In one 
embodiment of this invention the substrate is a probe for a gas phase ion spectrometer 
having a surface on which the adsorbent is attached. The adsorbent can be, eg. , a metal 

30 chelating adsorbent (eg. , IMAC3 Cull chip). In another embodiment the kit fiirther 
comprises (1) an eluant wherein the marker or markers are retained on the adsorbent 
when washed with the eluent, or (2) instructions for washing the adsorbent with the eluent 
after contacting the adsorbent with the marker or markers. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1: Representative SELDI spectra of one LCM sample from PCA 
epithelial cells in m/z range of (a) 3000-10,000, (b) 10,000 to 20,000 and (c) 20,000 to 
80,000. An average of 70 peaks were detected for each cell lysate analyzed. 
5 Figure 2a: Representative spectra and gel-view of matched prostate 

epithelial cell lysates showing protein alterations in the mass/charge(ni/z) range of 4000- 
6000. (arrows indicate over-expressed peaks in PIN and PCA.) 

Figure 2b: SELDI gelview protein profiles of lysates prepared from cells 
procured from different prostatectomy specimens. The red box identifies a peak with an 
10 average mass of 4749 Da that appears to be more abundant in PIN and PCA epithelial 
cells. Replicate samples(*) show good reproducibility of the protein pattems. The 
different protein pattems observed for two different PCA foci in Patient #2 and the two 
PIN foci in Patient #3, may be the result of genetic heterogeneity 

Figure 2c: Representative spectra and gel-views of matched prostate 
1 5 epithelial cell lysates showing increased abundance of a peak at m/z of 5666 in the BPH 
sample. 

Figure 3: Example of protein peaks found between 20,000-70,000 Da. 
Differential expression of a peak at 28400 Da, thought to be PSA was observed in 
epithelial cell lysates, but absent in lysates fcom adjacent stromal cells. The intracellular 
20 free PSA appears to be under-expressed in the PIN and PCA epithelial cell lysates. The 
serum albumin peak at 66800 Da was present in each sample. 

Figure 4: Expression levels of peptide/proteins compared to the expression 
levels in matched normal epithelial cells. The average expression m normal cell lysates 
was subtracted from the expression levels found in the other cell types. Upward bars 
25 indicate over-expression, downward bars indicate under-expression. 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

I. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herem 
30 have the meaning commonly understood by a person skilled in the art to which this 

invention belongs. The following references provide one of skill wifli a gen^ definition 
of many of the terais used in this invention: Singleton et al. Dictionary of Microbiology 
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and Molecular Biology (2nd ed. 1994); The Cambridge Dictionary of Science and 
Technology (Walker ed., 1988); The Glossary of Genetics, 5th Ed., R. Rieger et aL (eds.). 
Springer Verlag (1991); and Hale & Marham, The Harper Collins Dictionary of Biology 
(1991). As used herein, the following terms have the meanings ascribed to them unless 
5 specified otherwise. 

'•Marker*' in the context of the present invention refers to an organic 
biomolecule, particularly a polypeptide, which is differentially present in a sample taken 
fix)m patients having prostate cancer, prostate intraepithelial neoplasia, or benign prostate 
hyperplasia as compared to a comparable sample taken from subjects who do not have 

10 prostate cancer {e.g, negative diagnosis, normal or healthy subject). For example, a 
marker can be a polypeptide (having a particular apparent molecular weight) which is 
present at an elevated or decreased level in samples of prostate cancer patients compared 
to samples of patients with a negative diagnosis. 

"Organic biomolecule" refers to an organic molecule of biological origin, 

15 e.g.^ steroids, amino acids, nucleotides, sugars, polypeptides, polynucleotides, complex 
carbohydrates or lipids. 

The phrase ''diff^entially present" refers to difTerences in the quantity of a 
polypeptide (of a particular qiparent molecular weight) present in a sample taken from 
patients having prostate cancer as compared to a comparable sample taken from patients 

20 who do not have prostate canicer {e.g. , have braign prostate hyperplasia). A polypeptide 
is differentially present between the two samples if the amount of the polypeptide in one 
sample is significantly different from the amount of the polypeptide in the other sample. 
For example, a polypeptide is differentially present between the two samples if it is 
present in an amount (e.g., concentration, mass, molar amount, etc.) at least about 150%, 

25 at least about 200%, at least about 500% or at least about 1000% greater than it is present 
in the other sample, or if it is detectable in one sample and not detectable in the other. 
Any polypeptides that are differentially present in samples taken from prostate cancer 
patients as compared to subjects who do not have prostate cancer (e.g., benign prostate 
hyperplasia patients) can be used as markers. 

30 "Diagnostic" means idmtifying the presence or nature of a pathologic 

condition. Diagnostic methods differ in their sensitivity and ^ecificity. ''Sensitivity" is 
defined as the percent of diseased (Benign Prostate Hyperplasia (BPH)/Prostate 
Intraepithelial Neoplasia (PIN)/Prostate Adenocaroinoma (PCA)) cell types for which the 
biomaiker of interest is present, (true positive/total number of diseased lysates x lOO). 
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"Specificity** is defined as the percent of cell types for which the biomarker of interest is 
not positive, (true negative/total number of lysates without disease x 100). Diseased 
individuals not detected by the assay are "false negatives." Subjects who are not diseased 
and who test negative in the assay, are termed **true negatives." While a particular 
5 diagnostic method may not provide a definitive diagnosis of a condition, it suffices if the 
method provides a positive indication that aids in diagnosis. 

A **test amount" of a marker refers to an amount of a marker present in a 
sample being tested. A test amount can be either in absolute amount (e.g., ^g/ml) or a 
relative amount (e.g., relative intensity of signals). 
10 A "diagnostic amount" of a marker refers to an amoimt of a marker in a 

subject's sample that is consistent with a diagnosis of prostate cancer. A diagnostic 
amount can be either in absolute amount (e.g., ^g/ml) or a relative amount ie.g., relative 
intensity of signals). 

A "control amount" of a marker can be any amount or a range of amount 
1 5 which is to be compared against a test amount of a marker. For example, a control 
amount of a marker can be the amount of a marker (e.g., seminal basic proteui) in a 
prostate cancer patient, a BPH patient or a person without prostate cancer or BPH. A 
control amount can be either in absolute amount (e.gy jig/ml) or a relative amount (eg., 
relative intensity of signals). . 
20 "Probe" refers to a device that, when positionally engaged in an 

interrogatable relationship to an ionization source, e.g., a laser desorption/ionization 
source, and in concurrent conmtnmication at atmospheric or sub-atmospheric pressure 
with a detector of a gas phase ion spectrometer, can be used to introduce ions derived 
from an analyte into the spectrometer. As used herein, the **probe" is typically reversibly 
25 engageable (e.g., removably insertable) with a probe interface that positions the probe in 
an interrogatable relationship witii the ionization source and in communication with the 
detector . A probe will generally comprise a substrate comprising a sample presenting 
surface on which an analyte is presented to the ionization source. 

"Substrate" or ••probe substrate" refers to a solid phase onto which an 
30 adsorbent can be provided ie.g. , by attachment, deposition, etc.) 

"Ionization source" refers to a device that directs ionizing energy to a 
sample presenting surface of a probe to desorb and ionize analytes Srom the probe surface 
into the gas phase. The preferred ionization source is a laser (used in laser 
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desorption/ionization), in particular, nitrogen lasers, Nd-Yag lasers and other pulsed laser 
sources. Other ionization sources include fast atoms (used in fast atom bombardment), 
plasma energy (used in plasma desorption) and primary ions generating secondary ions 
(used in secondary ion mass spectrometry). 

"Gas phase ion spectrometer^* refers to an apparatus that detects gas phase 
ions. In the context of this invention, gas phase ion spectrometers include an ionization 
source used to generate the gas phase ions. Gas phase ion spectrometers include, for 
example, mass spectrometers, ion mobility spectrometers, and total ion current measuring 
devices. 

"Gas phase ion spectrometry^' refers to a method comprising employing an 
ionization source to generate gas phase ions from an analyte presented on a sample 
presenting surface of a probe and detecting the gas phase ions with a gas phase ion 
spectrometer. 

•*Mass spectrometer^* refers to a gas phase ion spectrometer that measures 
a parameter which can be translated into mass-to-charge ratios of gas phase ions. Mass 
spectrometers generally include an inlet system, an ionization source, an ion optic 
assembly, a mass analyzer, and a detector. Examples of mass spectrometers are time-of- 
flight, magnetic sector, quadrapole filter, ion trap, ion cyclotron resonance, electrostatic 
sector analyzer and hybrids of these. 

•*Mass spectrometry** refers to a method comprising employing an 
ionization source to generate gas phase ions from an analyte presented on a sample 
presenting surface of a probe and detecting the gas phase ions with a mass spectrometer. 

**Laser desorption mass spectrometer'* refers to a mass spectrometer which 
uses laser as a means to desorb, volatilize and ionize an analyte. 

"Adsorbent** refers to any material capable of adsorbing a marker. The 
term "adsorbent** is used herein to refer both to a single material C*monoplex adsorbent**) 
a compound or functional group) to which the marker is exposed, and to a plurality 
of different materials C*multiplex adsorbent**) to which the marker is exposed. The 
adsorbent materials in a multiplex adsorbent are referred to as "adsorbent species.** For 
example, an addressable location on a probe substrate can comprise a multiplex adsorbent 
charactoized by many different adsori>CTt species (e.^., anion exchange materials, metal 
chelators, or antibodies), having different binding characteristics. Substrate material itself 
can also contribute to adsorbing a marker and may be considered part of an "adsorbent.** 
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"Adsorption" or **rctention'* refers to the detectable binding between an 
absorbent and a marker either before or after washing with an eluant (selectivity threshold 
modifier) or a washing solution. 

"Eluant" or '^washing solution" refers to an agent that can be used to 
5 mediate adsorption of a marker to an adsorbent Eluants and washing solutions also are 
referred to as "selectivity threshold modifiers." Eluants and washing solutions can be 
used to wash and remove unbound materials firom the probe substrate surface. 

"Resolve," "resolution," or "resolution of marker" refers to the detection 
of at least one marker in a sample. Resolution includes the detection of a plurality of 
10 markers in a sample by separation and subsequent differential detection. Resolution does 
not require the complete separation of a marker fi-om all other markers in a mixture. 
Rather, any separation that allows the distinction between at least two markers suffices. 

"Detect" refers to identifying the presence^ absence or amount of the 
object to be detected. 

15 The terms •^polypeptide," '"pqjtide" and ••protein" are used interchangeably 

herein to refer to a polymer of amino acid residues. The terms apply to amino acid 
polymers in which one or more amino acid residue is an analog or mimetic of a 
corresponding naturally occurring amino acid, as well as to naturally occurring amino 
acid polymers. Polypeptides can be modified, e.g^., by the addition of carbohydrate 

20 residues to form glycoproteins. The terms ••polypeptide," ••peptide" and '^protein" include 
glycoproteins, as well as non-glycoproteins. 

••Detectable moiety" or a ••label" refers to a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, or chemical means. For 
example, useful labels include ^^P, ^^S, fluorescent dyes, electron-dense reagents, 

25 enzymes {e,g, , as commonly used in an ELIS A), biotin-streptavidin, digoxigenin, haptens 
and proteins for which antisera or monoclonal antibodies are available, or nucleic acid 
molecules with a sequence complementary to a target. The detectable moiety often 
generates a measurable signal, such as a radioactive, chromogenic, or fluorescent signal, 
that can be used to quantify the amount of bound detectable moiety in a sample. The 

30 detectable moiety can be incorporated in or attached to a primer or probe either 

covalently, or through ionic, van der Waals or hydrogen bonds, eg., incorporation of 
radioactive nucleotides, or biotinylated nucleotides that are recognized by streptavidin. 
The detectable moiety may be directly or indirectly detectable. Indirect detection can 
involve the binding of a second directly or indirectly detectable moiety to flie detectable 
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moiety. For example, the detectable moiety can be the ligand of a binding partner, such 
as biotin, which is a binding partner for streptavidin, or a nucleotide sequence, which is 
the binding partner for a complementary sequence, to which it can specifically hybridize. 
The binding partner may itself be directly detectable, for example, an antibody may be 
5 itself labeled with a fluorescent molecule. The binding partner also may be indirectly 
detectable, for example, a nucleic acid having a complementary nucleotide sequence can 
be a part of a branched DNA molecule that is in turn detectable through hybridization 
with other labeled nucleic acid molecules. {See^ e.g., P. D. Fahrlander and A. Klausner, 
Bio/Technology 6:1 165 (1988)). Quantitation of the signal is achieved by, e.g., 

1 0 scintillation counting, densitometry, or flow cytometry. 

"Measure" in all of its granmiatical forms, refers to detecting, quantifying 
or qualifying the amount (including molar amount), concentration or mass of a physical 
entity or chemical composition either in absolute terms in the case of quantifying, or in 
terms relative to a comparable physical entity or chemical composition, 

1 5 "Antibod/* refers to a polypeptide ligand substantially encoded by an 

immunoglobulin gene or immunoglobulin genes, or fragments thereof, which specifically 
binds and recognizes an epitope (eg., an antigen). The recognized immunoglobulin 
genes include the kappa and lambda light chain constant region genes, the alpha, gamma, 
delta, epsilon and mu heavy chain constant region genes, and the myriad immunoglobulin 

20 variable region genes. Antibodies exist, e.g., as intact immunoglobulins or as a number 
of well characterized firagments produced by digestion with various peptidases. This 
includes, e.g.. Fab* and F(ab)*2 firagments. The term "antibody," as used herein, also 
includes antibody firagments either produced by the modification of whole antibodies or 
those synthesized de novo using recombinant DNA methodologies. It also includes 

25 polyclonal antibodies, monoclonal antibodies, chimeric antibodies, humanized antibodies, 
or single chain antibodies. •Tc'* portion of an antibody refers to that portion of an 
immmoglobulin heavy chain that comprises one or more heavy chain constant region 
domams, CHI, CH2 and CH3, but does not include the heavy chain variable region. 

"Immimoassa/' is an assay that uses an antibody to specifically bind an 

30 antigen. The immunoassay is characterized by the use of specific binding properties of a 
particular antibody to isolate, target, and/or quantify the antigen. 

The phrase "specifically (or selectively) binds" to an antibody or 
"specifically (or selectively) immunoreactive with," when referring to a protein or 
peptide, refers to a binding reaction that is determinative of the presence of the protein in 
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a heterogeneous population of proteins and other biologies. Thus, under designated 
immunoassay conditions, the specified antibodies bind to a particular protein at least two 
times the background and do not substantially bind in a significant amount to other 
proteins present in the sample. Specific binding to an antibody imder such conditions 
may require an antibody that is selected for its specificity for a particular protein. For 
example, polyclonal antibodies raised to seminal basic protein fi-om specific species such 
as rat, mouse, or human can be selected to obtain only those polyclonal antibodies that are 
specifically immunoreactive with seminal basic protein and not with other proteins, 
except for polymorphic variants and alleles of seminal basic protein. This selection may 
be achieved by subtracting out antibodies that cross-react with seminal basic protein 
molecules fi-om other species. A variety of immunoassay formats may be used to select 
antibodies specifically immunoreactive with a particular protein. For example, solid- 
phase ELISA immunoassays are routinely used to select antibodies specifically 
immunoreactive with a protein {see, e.g., Harlow & Lane, Antibodies, A Laboratory 
Manual (1988), for a description of inununoassay formats and conditions that can be used 
to determine specific immunoreactivity). Typically a specific or selective reaction will be 
at least twice background signal or noise and more typically more than 10 to 100 times 
background. 

II. GAS PHASE ION SPECTROMETRY 

In one aspect, the invention provides methods for detecting markers which 
are differentially present in samples &om patients with prostate cancer, prostate 
intraepithelial neoplasia, benign prostate hyperplasia and with negative diagnoses. Any 
one or combination of markers described are within the scope of this aspect of this 
invention and can be detected. The methods for detecting these markers have many 
applications. For example, one marker or combination of markers can be measured to 
differentiate between PCA or PIN and BPH, and thus are usefiil as an aid in the diagnosis 
of prostate cancer in a patient. In another example, the present methods for detecting 
these markers can be applied to in vitro prostate cancer cells or in vivo animal models for 
prostate cancer to assay for and identify compoxmds that modulate expression of these 
markers. In one embodiment of the detection method, the marker or markers are detected 
by gas phase ion spectrometry. In a preferred embodiment, the marker or markers are 
detected by mass spectrometry and, in particular, laser desoiption mass spectrometry. 
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Probes suitable for use in the invention are described in, e.g., U.S. Patent 
Nos, 5,617,060 (Hutchens and Yip) and 6,225,047 (Hutchens and Yip). In one 
embodiment, a substrate comprising an adsorbent can be in the form of a probe, which is 
removably insertable into a gas phase ion spectrometer. For example, a substrate can be 

5 in the form of a strip with adsorbents on its siuface. In another embodiment, a substrate 
comprising an adsorbent can be positioned onto another substrate to form a probe, which 
is removably insertable into a gas phase ion spectrometer. For example, a substrate 
comprising an adsorbent can be a solid phase, such as a polymeric or glass bead with a 
functional group for binding a marker, which can be subsequently positioned on a second 

10 substrate to form a probe. For example, the second substrate can be in the form of a strip, 
or a plate having a series of wells at a predetermined addressable locations. One 
advantage of this embodiment is that the marker can be adsorbed to the first substrate in 
one physical context, and transferred to the second substrate, which can then be submitted 
for analysis by gas phase ion spectrometry. The probe can be in any shape as long as it is 

1 5 removably insertable into a gas phase ion spectrometer. 

The probe can also be adapted for use with inlet systems and detectors of a 
gas phase ion spectrometer. For example, the probe can be adapted for mounting in a 
horizontally, vertically and/or rotationally translatable carriage that moves the probe to a 
successive position without requiring repositioning of the probe by hand. 

20 The probe substrate is preferably made of a material that is cs^able of 

supporting adsorbents. For example, the probe substrate material can include, but is not 
limited to, insulating materials (e.g., glass, ceramic), semi-insulating materials (e.g., 
silicon wafers), or electrically conducting materials (e.g., metals, such as copper, brass, 
steel, aliuninimi, gold, or electrically conductive polymers), organic polymers, 

25 biopolymers, or any combinations thereof. 

The probe substrate surface can be conditioned to bind markers. For 
example, in one embodiment, the surface of the probe substrate can be conditioned (e.g., 
chemically or mechanically such as roughening) to place adsorbents on the surface. The 
adsorbent comprises functional groups for binding with a marker. In some embodiments, 

30 the substrate material itself can also contribute to adsorbent properties and may be 
considered part of an ^'adsorbent.** 

Any number of different adsorbents can be used as long as they have 
binding characteristics suitable for binding the markers of the present invention. The 
adsorbents can comprise a hydrophobic group, a hydrophilic group, a cationic group, an 

14 
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anionic group, a metal ion chelating group, or antibodies which specifically bind to 
antigens, or a combination thereof (sometimes referred to as "a mixed mode" adsorbent). 
Exemplary adsorbents comprising a hydrophobic group include matrices having aliphatic 
hydrocarbons, Ci-Cis aliphatic hydrocarbons and matrices having aromatic 

5 hydrocarbon functional group such as phenyl groups. Exemplary adsorbents comprising 
a hydrophilic group include silicon oxide glass), or hydrophilic polymers such as 
polyethylene glycol, dextran, agarose, or cellulose. Exemplary adsorbents comprising a 
cationic group include matrices of secondary, tertiary or quaternary amines. Exemplary 
adsorbents comprising an anionic group include matrices of sulfate anions (SO3*) and 

10 matrices of carboxylate anions (i.e., COO ) or phosphate anions (OPOs")- Exemplary 

adsorbents comprising metal chelating groups include organic molecules that have one or 
more electron donor groups which form coordinate covalent bonds with metal ions, such 
as copper, nickel, cobalt, zinc, iron, and other metal ions such as aluminum and calcium. 
Exemplary adsorbents comprising an antibody include antibodies that are specific for any 

1 S one of the markers provided herein. In preferred embodiments, adsorbents are 

substantially similar to or the same as the adsorbents which were used to enrich and 
identify the markers. 

One useful chip is the Normal Phase ProteinC3iip® array, available fix>m 
Ciphergen Biosystems, Inc. (Palo Alto, CA). The normal phase chip has a hydrophilic 

20 adsorbent comprising silicon oxide (Si02) on the substrate surface. Silicon oxide can be 
applied to the surface by any of a number of well known methods. These methods 
include, for example, vapor deposition, e.g.^ sputter coating. A preferred thickness for 
such a probe is about 9000 Angstroms. 

Another useful chip is the SAXl ProteinChip® array made by Ciphergen 

25 Biosystems, Inc. in Palo Alto, CA. The SAXl protein chips are fabricated fix>m Si02 
coated aluminum substrates. In the process, a suspension of quaternary ammonium 
polystryenemicrospheres in distilled water is deposited onto the surface of the chip (1 
mL/spot, two times). After air drying (room temperature, 5 minutes), the chip is rinsed 
with deionized water and air dried again (room temperature, 5 minutes). 

30 Another useful chip is the IMAC3 (Immobilized Metal Affinity Capture, 

nitrilotriacetic acid on surface) ProteinChip® array, also available from Ciphergen 
Biosystems, Inc. The chips are produced as follows: 5-Methacylamido-2-(N,N- 
biscaxfooxymethaylantiino)pentanoic acid (7.5 wt %),Acryloyltri- 
(hydroxymethyl)methylamine (7.5 wt %) and N J^'-methylenebisacrylamide (0.4 wt %) 
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are photo-polymerized using-(-)riboflavin (0.02 wt %) as a photo-initiator. The monomer 
solution is deposited onto a rough etched, glass coated substrate (0.4 mL, twice) and 
irradiated for 5 minutes with a near UV exposure system (Hg short arc lamp, 20 mW/cm^ 
at 365 nm). The surface is washed with a solution of sodiimi chloride (1 M) and then 
S washed twice with deionized water. 

The IMAC3 with Cu(II) is activated as follows. The surface is treated 
with a solution of CUSO4 (100 mM, 10 ^iL/spot) and mixed on a high frequency mixer for 
10 minutes. After removing the CUSO4 solution, the chips are rinsed with water and each 
spot is treated with a solution of sodium acetate (100 mM, pH 4.0, 10 jil per spot). The 
1 0 sodium acetate is removed and the surface is washed with a stream of deionized water (15 
sec/chip). 

Adsorbents can be placed on the probe substrate in continuous or 
discontinuous patterns* If continuous, one or more adsoibents can be placed on the 
substrate surface. If multiple types of adsorbents are used, the substrate surface can be 

1 5 coated such that one or more binding characteristics vary in one or two-dimensional 

gradient. If discontinuous, plural adsorbents can be placed in predetermined addressable 
locations on the substrate surface. The addressable locations can be arranged in any 
pattern, but are preferably in regular pattern, such as lines, orthogonal arrays, or regular 
curves (e.g., circles). Each addressable location may comprise the same or different 

20 adsorbent. The spots are "addressable" in that during mass spectrometry an energy 

source, such as a laser, may be directed to (i.e., **addresses") each spot differentially to 
desorb the analyte. 

The probes can be produced using any suitable methods depending on the 
selection of substrate materials and/or adsorbents. For example, the surface of a metal 

25 substrate can be coated with a material that allows d^vitization of the metal surface. 
More specifically, a metal surface can be coated with silicon oxide, titanium oxide or 
gold. Then surface can be derivatized with a bifunctional linker, one end of which can 
covalently bind with a functional group on the surface and the other end of which can be 
further derivatized with groups that function as an adsorbent. In another example, a 

30 porous silicon surface generated from crystalline silicon can be chemically modified to 
include adsorbents for binding markers. In yet another sample, adsorbents with a 
hydrogel backbone can be formed directly on the substrate surface by in situ 
polymerizing a monomer solution which comprises, e.g.^ substituted acrylamide 

16 
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monomers, substituted acrylate monomers, or derivatives thereof comprising a functional 
group of choice as an adsorbent. 

The probe substrate comprising an adsorbent contacts a sample. The 
sample is preferably a biological fluid sample. Examples of biological fluid samples 
include blood, serum, urine, prostatic fluid, seminal fluid, semen, seminal plasma and 
prostate tissue (e.g., epithelial tissue, including extracts thereof)- 

The sample can be solubilized in or admixed with an eluant. The probe 
substrate comprising an adsorbent then contacts the solution using any techniques 
including bathing, soaking, dipping, spraying, washing over, or pipetting, etc. Generally, 
a volume of sample containing from a few attomoles to 100 picomoles of marker in about 
I fil to 500 III is sufficient for binding to the adsorbent. 

The sample can contact the probe substrate comprising an adsorbent for a 
period of time sufficient to allow the marker to bind to the adsorbent. Typically, the 
sample and the substrate comprising the adsorbent are contacted for a period of between 
about 30 seconds and about 12 hours, and preferably, between about 30 seconds and 
about IS minutes. 

The temperature at which the sample contacts the probe substrate 
comprising an adsorbent can be a function of the particular sample and ttie selected probe. 
Typically, the sample is contacted to the probe substrate undo* ambient temperature and 
pressure conditions. For some samples, however, modified temperature (typically 4®C 
through ST'^C), and pressure conditions can be desirable, which conditions are 
determinable by those skilled in the art. 

After the probe substrate comprising an adsorbent contacts the sample or 
sample solution, it is preferred that unbound materials on the probe substrate surface are 
washed out so that only the boimd materials remain on the substrate surface. Washing a 
probe substrate siirface can be accomplished by, e.g., bathing, soaking, dippmg, rinsing, 
spraying, or washing the substrate surface with an eluant or a washing solution. A 
microfluidics process is preferably used when a washing solution such as an eluant is 
introduced to small spots of adsorbents on the probe. Typically, the washing solution can 
be at a temperature of between O'^C and lOO'^C, preferably between 4^*0 and 37'*C. 

Any suitable washing solutions or eluants can be used to wash the probe 
substrate surface. For example, organic solutions or aqueous solutions can be used. 
Preferably, an aqueous solution is used. Exemplary aqueous solutions include a HEPES 
buffer, a Tris buffer, a phosphate buffered saline, etc. The selection of a particular 
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washing solution or an eluant is dependent on other experimental conditions (e.g., types 
of adsorbents used or markers to be detected), and can be determined by those of skill in 
the art. For example, if a probe comprising a hydrophobic group and a sulfonate group as 
adsorbents {e.g., SAXl ProteinChip® array) is used, then an aqueous solution, such as a 
HEPES buffer, may be preferred. In another example, if a probe comprising a metal 
binding group as an adsorbent (eg., Cu(II) ProteinChip® array) is used, then an aqueous 
solution, such as a phosphate buffered saline, may be preferred. In yet another example, 
if a probe comprising a hydrophobic group (eg., H4 ProteinChip® array) is used, then 
water may be preferred as a washing solution. 

An energy absorbing molecule (e.g., in solution) can be applied to markers 
or other substances bound on the probe substrate surface. Spraying, pipetting, or dipping 
can be used. This can be done after imboxmd materials are washed off of the probe 
substrate surface. An energy absorbing molecule refers to a molecule that absorbs energy 
from an energy source in a gas phase ion spectrometer, thereby assisting desorption of 
markers or other substances from a probe surface. Exemplary energy absorbing 
molecules include cinnamic acid derivatives, sinapinic acid and dihydroxybenzoic acid. 

After the marker is boimd to the probe, it is detected using gas phase ion 
spectrometry. Markers or other substances bound to the adsorbents on flie probes can be 
analyzed using a gas phase ion spectrometer. The quantity and characteristics of the 
marker can be determined using gas phase ion spectrometry. Other substances in addition 
to the marker of interest can also be detected by gas phase ion spectrometry. 

In one embodiment, a mass spectrometer can be used to detect markers on 
the probe. In a typical mass spectrometer, a probe with a marker is introduced into an 
inlet system of the mass spectrometer. The analyte is then desorbed by a desorption 
source such as a laser, fast atom bombardment, or high energy plasma. The generated 
desorbed, volatilized species consist of preformed ions or neutrals which are ionized as a 
direct consequence of the desorption event. Generated ions are collected by an ion optic 
assembly, and then a mass analyzer disperses and analyzes the passing ions. The ions 
exiting the mass analyzer are detected by a detector. The detector then translates 
information of the detected ions into mass-to-charge ratios. Detection of the presence of 
a marker or other substances will typically involve detection of signal intensity. This, in 
turn, can reflect the quantity and character of a marker bound to the probe. 

In a preferred embodiment, a laser desorption time-of-flig(ht mass 
spectrometer is used with the probe of the present invention* In laser desoxption mass 
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Spectrometry, a probe with a boxind analyte is introduced into an inlet system. The 
analyte is desorbed and ionized into the gas phase by laser from the ionization soiirce. 
The ions generated are collected by an ion optic assembly, and then in a time-of-flight 
mass analyzer, ions are accelerated through a short high voltage field and let drift into a 
high vacuum chamber. At the far end of the high vacuimi chamber, the accelerated ions 
strike a sensitive detector surface at a different time. Since the time-of-flight is a function 
of the mass of the ions, the elapsed time between ion formation and ion detector impact 
can be used to identify the presence or absence of molecules of specific mass to charge 
ratio. As any person skilled in the art understands, any of these components of the laser 
desorption time-of-flight mass spectrometer can be combined with other components 
described herein in the assembly of mass spectrometer that employs various means of 
desorption, acceleration, detection, measurement of time, etc. 

In another embodiment, an ion mobility spectrometer can be used to detect 
and characterize a marker. The principle of ion mobility spectrometry is based on 
different mobility of ions. Specifically, ions of a sample produced by ionization move at 
different rates, due to their difference in, e.g.^ mass, charge, or shape, through a tube 
under the influaice of an electric field. The ions (typically in the form of a current) are 
registered at the detector which can then be used to identify a marker or other substances 
in the sample. One advantage of ion mobility spectrometry is that it can operate at 
atmospheric pressure. 

In yet another embodiment, a total ion current measuring device can be 
used to detect and characterize markers. This device can be used when the probe has a 
surface chemistry that allows only a single type of marker to be bound. When a single 
type of marker is boimd on the probe, the total current generated from the ionized marker 
reflects the nature of the marker. The total ion current produced by the marker can then 
be compared to stored total ion current of known compounds. Characteristics of the 
marker can then be determined. 

Data generated by desorption and detection of markers can be analyzed 
with the use of a programmable digital computer. The computer program generally 
contains a readable medium that stores codes. Certain code can be devoted to memory 
that includes the location of each feature on a probe, the identity of the adsoifoent at that 
feature and the elution conditions used to wash the adsorbent. Using this information* the 
program can then identify the set of features on the probe defining certain selectivity 
characteristics {e.g.^ types of adsorbent and eluants used). The computer also contains 
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code that receives as input, data on the strength of the signal at various molecular masses 
received from a particular addressable location on the probe. This data can indicate the 
number of markers detected, optionally including the strength of the signal and the 
determined molecular mass for each marker detected. 
5 Data analysis can include the steps of determining signal strength (e.g., 

height of peaks) of a marker detected and removing "outliers" (data deviating from a 
predetermined statistical distribution). For example, the observed peaks can be 
normalized, a process whereby the height of each peak relative to some reference is 
calculated. For example, a reference can be backgroxmd noise generated by instrument 

10 and chemicals (e.g^., energy absorbing molecule) which is set as zero in the scale. Then 
the signal strength detected for each marker or other substances can be displayed in the 
form of relative intensities in the scale desired (eg., 100). Alternatively, a standard may 
be admitted with the sample so that a peak from the standard can be used as a reference to 
calculate relative intensities of the signals observed for each marker or other markers 

IS detected. 

The computer can transform the resulting data into various formats for 
displaying. In one format, referred to as "spectrum view or retentate map," a standard 
spectral view can be displayed, wherein the view depicts the quantity of marker reaching 
the detector at each particular molecular weight. In another format, referred to as *^eak 

20 map," only the peak height and mass information are retained from the spectrum view, 
yielding a cleaner image and ^labling markers with nearly identical molecular weights to 
be more easily seen. In yet another format, referred to as "gel view," each mass from the 
peak view can be converted into a grayscale image based on the height of each peak, 
resulting in an appearance similar to bands on electrophoretic gels. In yet another format, 

25 referred to as "3-D overlays," several spectra can be overlaid to study subtle changes in 
relative peak heights. In yet another format, referred to as "difference map view," two or 
more spectra can be compared, conveniently highlighting unique markers and markers 
which are up- or down-regulated between samples. Marker profiles (spectra) from any 
two samples may be compared visually. 
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III. METHODS OF DETECTING MARKERS 

A. Characterization Of The Markers 

A set of markers was identified that is useful in distinguishing prostate 
cancer, prostate intraepithelial neoplasia, benign prostate hyperplasia and a negative 
diagnosis. The set was identified in cell lysate fi-om prostate epithelial tissue. 

1. Cell Lysate Marker Set 

A set of markers useful for diagnosing prostate cancer, prostate 
intraepithelial neoplasia, benign prostate hyperplasia and a negative diagnosis has the 
following molecular weights as determined by mass spectrometry: Marker EPl: 3448 ± 
19 Da, Marker EP2: 4036 ± 22 Da, Marker EP3: 4361 ± 24 Da, Marker EP4: 4413 ± 24 
Da, Marker EPS: 4639 ± 26 Da, Marker EP6: 4749 ± 26 Da, Marker EP7: 4827d= 27 Da, 
Marker EPS: 5666 ± 31 Da, Marker EP9: 8445 ± 46 Da, Marker EPIO: 11744 ± 65 Da, 
Marker EPl 1 : 14696 ± 81 Da, Marker EP12: 24184 ± 133 Da, Marker EP13: 48308 ± 
266 Da, and Marker EP14: 53830 ± 296 Da. 

These markers are further characterized as follows: Cell lysate firom 
prostate epithelial cells isolated by laser capture microdissection was fi:actionated and 
analyzed by retentate chromatography. In this method, test samples were applied to a 
substrate comprising a copper chelate adsorbent The adsorbents were washed with a first 
eluent of 20 mM HEPES, pH 8.0, 0.1% TritonX 100, to allow binding of proteins to the 
chip, and with a second eluent of phosphate bufifered saline (PBS) to remove non-bound 
materials. Markers retained on the adsorbents were detected by SELDI mass 
spectrometry. The fourteen markers could be resolved by the metal chelate adsorbent. 

B. Sample Sources For Markers 

These maikers can be detected in many different samples derived fix>m a 
patient or subject. The sample is preferably a biolo^cal flmd sample. Examples of 
biological fluid samples useful in this invention include blood, serum, urine, prostatic 
fluid, seminai fluid, semen, seminal plasma and prostate tissue ie.g.^ epithelial tissue, 
including extracts thereof). 

Because the markers in the Marker Set were discovered in cell lysate fi^om 
prostate epithelium, this tissue is a preferred sample source for the methods of this 
invention. More particularly, the samples were collected by laser capture microdissection 
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of this tissue. Therefore, this is a preferred method of obtaining sample for the methods 
of this invention. 



C. Detection of Markers 

After a sample is obtained, any suitable method can be used to detect the 
5 marker in a sample from a subject being tested. For example, gas phase ion spectrometry 
or an immunoassay can be used. 

1. Mass Spectrometry 

In a preferred embodiment, the markers of this invention are detected 
using mass spectrometry, more preferably using gas phase ion spectrometry and, still 

10 more preferably, using surface-enhanced laser desorption/ionization mass spectrometry 
("SELDr'). SELDI is an improved method of gas phase ion spectrometry for 
biomolecules. In SELDI, the surface on which the analyte is applied plays an active role 
in the analyte capture, desorption and/or desorption. 

One popular method of gas phase ion spectrometry for biomolecules is 

1 5 MALDI (matrix-assisted laser desorption/ionization) mass spectrometry. In MALDI, the 
analyte is Qrpically mixed with a matrix material that, upon drying, forms crystals that 
capture the analyte. The matrix material absorbs energy from the energy source which 
otherwise would fragment the bimolecular analytes. 

20 a) Preparation of sample 

(i) Pre-fractionation 

In one embodiment, the sample can be pre-fiactionated before being 

subjected to gas phase ion spectrometry. Pre-fractionation has the advantage of providing 

a less complex sample for analysis. On the other hand, it introduces an extra step in the 
25 analytic process that could be unattractive in, for example, a clinical setting. Samples can 

be pre-fractionated by any means known in the art, including, without limitation, size 

fractionation and chromatographic fractionation. 

In one embodiment, seminal plasma samples can be pre-fractionated 

before analysis by gas-phase ion spectrometry. A preferred method of fraction includes a 
30 first fractionation by gel exclusion chromatography. Sizing columns which exclude 

molecules whose molecular mass is greater than 30 kDa are particularly useful for this. 

Fractions of various sizes then can be examined directly or subjected to a second 
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fractionation step based on anion exchange chromatography. Using an anion exchange Q 
spin column, markers can be eluted using a low strength buffer (e.g., about 10 mM to 50 
mM Tris, HEPES or PBS) with salt at low to medium concentration (e.g., about 0.1 M to 
0.6 M) and a non-ionic detergent at low concentration (e.g., TritonX 100 at about 0.05 to 
0.2%). A particularly useful buffer is 20 mM Tris, 0.5 M NaCl and 0.1% TritonX 100. 
In another embodiment, the markers are eluted using a pH gradient. 

(ii) Retentate Chromatography 

In another embodiment, the sample is fractionated on abio- 
chromatographic chip by retentate chromatography before gas phase ion spectrometry. A 
preferred chip is the Protein Chip® array available from Ciphergen Biosystems, Inc. 
(Palo Alto, CA). As described above, the chip or probe is adapted for use in a mass 
spectrometer. The chip comprises an adsorbent attached to its surface. This adsorbent 
can function, in certain applications, as an in situ chromatogr^hy resm. In basic 
operation, the sample is applied to the adsorbent in an eluent solution. Molecules for 
which the adsorbent has affinity under the wash condition bind to the adsoibent. 
Molecules that do not bind to the adsorbent are removed with the wash. The adsorbent 
can be further washed under various levels of stringency so that analytes are retained or 
eluted to an appropriate level for analysis. Then, an energy absorbing molecule can be 
added to the adsorbent spot to further facilitate desorption and ionization. The analyte is 
detected by desorption from the adsorbent, ionization and direct detection by a detector. 
Thus, retentate chromatography differs from traditional chromatography in that the 
analyte retained by the affinity material is detected, whereas in traditional 
chromatography, material that is eluted from the aflBnity material is detected. 

A useful adsorbent for the markers of the Marker Set is a metal chelate 
adsorbent and, in particular, copper. This surface also is usefully washed with a pH 
neutral buffer such as PBS. A preferred surface is IMAC3 from Ciphergen Biosystems, 
Inc. IMAC3 comprises a coppor chelate adsorbent. 

Another useful adsorbent for any of the markers of this invention is an 
antibody that specifically binds the marker. Chips comprising antibodies that bind to one 
or more markers are particularly useful for removing non-markers which do not bind to 
the antibodies and which function as '*noise" in the detection process. 

As will be evident to anyone skilled in the art, different markers may be 
more easily resolved using different combinations of adsorbents and eluants. 
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(iii) Mixing the sample with an energy absorbing 
matrix 

In MALDI applications the sample to be analyzed is mixed with an energy 
absorbing matrix prior to ionization and mass analysis. The sample/matrix mixture is 
then applied to the surface of an inert mass spectrometer probe. Suitable matrix materials 
are well know to those of skill in the art and include 3-hydroxypicolinic acid (3-hydroxy- 
2-pyridinecarboxylic acid), nicotinic acid, N-oxide, 2'-6'-dihydroxyacetophenone, gentisic 
acid (2,5-dihydroxybenzoic acid), a-cyano-4-hydroxycinnamic acid, feralic acid (4- 
hydroxy-3-methoxycinnamic acid) and sinapinic acid (3,5-dimethoxy-4-hydroxycinnamic 
acid). 

The resolving power of MALDI is limited by the complexity of the sample 
being analyzed. Therefore prefractionation or otherwise purifying the sample prior to 
MALDI analysis is preferred. 

(iv) Modification of Marker Before Analysis 

hi another embodiment, the markers are modified before detection in order 
to alter their molecular weigiht. These methods may decrease ambiguity of detection. For 
example, the markers may be subject to proteolytic digestion before analysis. Any 
protease can be used. Proteases such as trypsin, that are likely to cleave the markers into 
a discrete nxmxber of fragments are particularly useful. The fragments that result from 
digestion function as a fingerprint for the markers, thereby enabling their detection 
indirectly. This is particularly useful where there are markers with similar molecular 
masses that might be confiised for the marker in question. Also, proteolytic 
fiagmentation is useful for high molecular weight markers because smaller markers are 
more easily resolved by mass spectrometry. 

In another embodiment, the markers can be modified by the attachment of 
a tag of particular molecular wei^t that bind specifically to molecular markers, further 
distinguishing them. 

b) Performance of laser desorption/ionization mass 
spectrometry 

After the marker is detected by mass spectrometry, preferably gas phase 
ion spectrometry, a test amount of marker can be determined. For example, a signal is 
displayed at the molecular weight of the marker of interest Based on the strength or 
magnitude of the displayed signal, the amount of marker in a sample being tested can be 
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determined. It is noted that the test amount of marker in a sample need not be measured 
in absolute units, but can be in relative units as long as it can be compared qualitatively or 
quantitatively to a control amount of a marker. For example, the amount of the marker 
detected can be displayed in terms of relative intensity based on the background noise. 

S Preferably, the test amount and the control amount of markers are measured under the 
same conditions. 

If desired, the absolute amount of a marker can be determined by 
calibration. For example, a purified, known marker can be added in increasing amounts 
to different spots of adsorbents on the probe surface. Then peaks from each spot can be 

10 obtained and plotted in a graph against the concentration of known marker protein at each 
spot. From the peak intensity vs. concentration plot, the absolute amount of a marker in 
any sample being tested can be determined. 

2. Immunoassay Detection 

In another embodiment of the detection method, an immunoassay can be 

IS used to qualitatively or quantitatively detect and analyze markers in a sample. This 
method comprises: (a) providing an antibody that specifically binds to a maxker, (b) 
contacting a sample with the antibody; and (c) detecting the presence of a complex of the 
antibody bound to the marker in the sample. 

To prepare an antibody that specifically binds to a marker, piuified 

20 markers or their nucleic acid sequences can be used. Nucleic acid and amino acid 

sequences for markers can be obtained by fiirther characterization of these markers. For 
example, each marker can be peptide mapped with a number of enzymes (e.g., trypsin, 
V8 protease, etc.). The molecular weights of digestion firagments from each marker can 
be used to search the databases, such as SwissProt database, for sequences that will match 

25 the molecular weights of digestion firagments generated by various enzymes. Using this 
method, the nucleic acid and amino acid sequences of other markers can be identified if 
these markers are known proteins in the databases. 

Alternatively, the proteins can be sequenced using protein ladder 
sequencing. Protein ladders can be generated by, for example, firagmenting the molecules 

30 and subjecting fragments to enzymatic digestion or other methods that sequentially 

remove a single amino acid from the end of the firagment. Methods of preparing protein 
ladders are described, for example, in International Publication WO 93/24834 (Chait et 
al.) and United States Patent 5,792,664 (Chait et al.). The ladder is then analyzed by 
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mass spectrometry. The difference in the masses of the ladder fragments identify the 
amino acid removed from the end of the molecule. 

If the markers are not known proteins in the databases, nucleic acid and 
amino acid sequences can be detemiined with knowledge of even a portion of the amino 

5 acid sequence of the marker. For example, degen^ate probes can be made based on the 
N-terminal amino acid sequence of the marker. These probes can then be used to screen a 
genomic or cDNA library created from a sample from which a marker was initially 
detected. The positive clones can be identified, amplified, and their recombinant DNA 
sequences can be subcloned using techniques which are well known. See^ eg.. Current 

1 0 Protocols for Molecular Biology (Ausubel et al , Green Publishing Assoc. and Wiley- 

Interscience 1989) and Molecular Cloning: A Laboratory Manual^ 2nd Ed. (Sambrook et 
al.. Cold Spring Harbor Laboratory, NY 1989). 

Using the purified markers or their nucleic acid sequences, antibodies that 
specifically bind to a marker can be prepared using any suitable methods known in the 

15 art. See, e.g., Coligan, Current Protocols in Immunology (1991); Harlow & Lane, 

Antibodies: A Laboratory Manual (1988); Coding, Monoclonal Antibodies: Principles 
and Practice (2d ed. 1986); and Kohler & Milstein, Nature 256:495-497 (1975). Such 
techniques include, but are not limited to, antibody preparation by selection of antibodies 
fix)m libraries of recombinant antibodies in phage or similar vectors, as well as 

20 preparation of polyclonal and monoclonal antibodies by immunizing rabbits or mice {see, 
eg., Huse et al. Science 246:1275-1281 (1989); Ward et al. Nature 341:544-546 
(1989)). 

After the antibody is provided, a marker can be detected and/or quantified 
using any of a number of well recognized immimological binding assays (see, e.g., U.S. 
25 Patents 4,366,241; 4,376,110; 4,517,288; and 4,837,168). Useful assays mclude, for 
example, an enzyme immune assay (EIA) such as enzyme-linked immxmosorbent assay 
(BLISA), a radioimmune assay (RIA), a Western blot assay, or a slot blot assay. For a 
review of the general inununoassays, see atso. Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993); Basic and Clinical Immunology (Stites & Teir, 
30 eds.,7thed. 1991). 

Generally, a sample obtained firom a subject can be contacted with the 
antibody that specifically binds the marker. Optionally, the antibody can be fixed to a 
solid support to facilitate washing and subsequent isolation of the complex, prior to 
contacting the antibody with a sample. Examples of solid supports include glass or 
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plastic in the form of, e.g., a microtiter plate, a stick, a bead, or a microbead. Antibodies 
can also be attached to a probe substrate or ProteinChip® array described above. (See 
e.g. Xiao et al. Cancer Research 62: 6029-6033 (2001)) The sample is preferably a 
biological fluid sample taken from a subject. Examples of biological fluid samples 

5 include blood, serum, urine, prostatic fluid, seminal fluid, semen, seminal plasma and 
prostate tissue (e.g., epithelial tissue, including extracts thereof). In a preferred 
embodiment, the biological fluid comprises seminal plasma. The sample can be diluted 
with a suitable eluant before contacting the sample to the antibody. 

After incubating the sample with antibodies, the mixture is washed and the 

10 antibody-marker complex formed can be detected. This can be accomplished by 

incubating the washed mixture with a detection reagent. This detection reagent may be, 
e.g., a second antibody which is labeled with a detectable label. Exemplary detectable 
labels include magnetic beads (e.g., DYNABEADS™), fluorescent dyes, radiolabels, 
enzymes (e.g., horse radish peroxide, alkaline phosphatase and others conmionly used in 

15 an ELISA), and colorimetric labels such as colloidal gold or colored glass or plastic 
beads. Alternatively, the marker in the sample can be detected using an indirect assay, 
wherein, for example, a second, labeled antibody is used to detect bound marker-specific 
antibody, and/or in a competition or inhibition assay wherein, for example, a monoclonal 
antibody which binds to a distinct epitope of the marker are incubated simultaneously 

20 with the mixture. 

Throughout the assays, incubation and/or washing steps may be required 
after each combination of reagents. Incubation steps can vary from about 5 seconds to 
several hours, preferably from about 5 minutes to about 24 hours. However, the 
incubation time will depend upon the assay format, marker, volume of solution, 

25 concentrations and the like. Usually the assays will be carried out at ambient 

temperature, although they can be conducted over a range of temperatures, such as lO^C 
to 40^C. 

The immunoassay techniques are well-known in the art, and a general 
overview of the applicable technology can be found in Harlow & Lane, supra. 
30 The immunoassay can be used to determine a test amount of a marker in a 

sample from a subject. First, a test amount of a marker in a sample can be detected using 
the inmiunoassay methods described above. If a marker is present in the sample, it will 
form an antibody-marker complex with an antibody that specifically binds the marker 
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under suitable incubation conditions described above. The amount of an antibody-marker 
complex can be determined by comparing to a standard. As noted above, the test amount 
of marker need not be measured in absolute units, as long as the unit of measurement can 
be compared to a control amount 

IV. METHODS OF DIAGNOSIS USING MARKERS 

The markers in the Marker Set may be used alone, in combination with 
other markers in the set, or with other markers entirely {e.g.y PSA or prostate specific 
membrane antigen) to aid in the diagnosis of prostate cancer, prostate intraepithelial 
neoplasia, benign prostate hyperplasia or a negative diagnosis. Used together, they tend 
to provide more information for the diagnostician, increasing the percentage of tme 
positive and trae negative diagnoses and decreasing the percentage of false positive or 
false negative diagnoses, than a single marker alone. 

In certain embodiments, it is useful to detect the presence or absence of a 
marker, without quantifying the amount of marker. This information is useful because 
the markers show differences in firequency of detection between PCA, PIN, BPH and 
negative diagnosis. In other embodiments, detecting the presence or absence of the 
markers can also involve quantifying the markers. This information is useful for several 
purposes. First, individual markers may be sUghtly detectable in certain states based on a 
weak peak, but easily detectable in other states based on a strong peak. Second, certain of 
the markers are fi-equently detected in all states, but their relative ratio differs 
dramatically among PCA PIN, BPH or normal samples. 

It is, of course, imderstood by diagnosticians that diagnostic tests are 
measured by their degree of specificity and sensitivity. Most diagnostic tests suffer firom 
some unperfection m their ability to provide a positive result for every true positive 
diagnosis, as well as some imperfection in their ability to provide a negative result for 
every true negative diagnosis. However, tests which are not perfectly specific or sensitive 
are useful in diagnosis because they render certain diagnoses more likely than not. For 
example, markers EPS and EP12 are detected in 43% of all samples positive for PCA and 
0% of negative samples. Thus, detection of EPS or EP12 is useful in that it provides a 
probable diagnosis of PCA. Failure to detect EPS or EP12 also is useful in that it 
provides a probable negative diagnosis. Furthermore, used in combination with other 
markers, the diagnostician can increase their confidence in a PCA or negative diagnosis. 
With this in mind, the markers have firequency distributions as follows. 
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A. Cell Lysate Marker Set 

Table 1 is a comparison of the relative abundance levels of markers EPI to 
EP14 in BPH, PIN, and PC A cell lysates. The relative quantity can be determined in gas 
phase ion spectrometry either by peak height or peak area. Preferably, this determination 
5 is normalized with respect to an internal standard, which is present in all samples. 
Markers EP2, EPS, EPS, and EP6 occur in a higher abundance in both PIN and PCA 
samples. Therefore, detection of two or three of these markers is positively correlated 
with a pathologic diagnosis. EPS occurs in higher abimdance in BPH samples. 
Therefore, detection of this marker is positively correlated with a normal or benign 
10 diagnosis. 

Table 2 displays the percent of BPH, PIN, or PCA prostate cell lysates 
found to be over- or under-expressing markers EP1-EP14. Marker EP 1 appears to be 
expressed in all cell types but is present in increasing amounts in BPH, PIN, and PCA cell 
lysates. EPIO appears to be under-expressed in BPH and PIN cell lysates and a majority 
IS of the PCA cell lysates. EP13 appears to be under-expressed in a majority of the PIN and 
PCA cell lysates. 

Table 3 shows the statistical analysis of the difFermtial expression of 
selected proteins as determined by the Wilcoxon signed ranks test. The differential 
expression of EP2, EP3, and EP6 was significant in differentiating PIN and/or PCA fix>m 
20 non-canc^ cells; while the differential expression of EPS was significant in 
distinguishing BPH from PIN and PCA. 

Table 4 displays the sensitivity and specificity of single markers and 
combination of markers found in PIN and PCA cell lysates. Combining markers EP2 and 
EPS, EP3 and EPS, or EP3 and EP6 increased the sensitivity for identifying PIN or PCA 
2S samples. Therefore, detection of these combinations of markers is positively correlated 
with a pathological diagnosis. 

Table S shows that by combining the expression levels of seven 
biomarkers, EP2, EP3, EP4, EPS. EP6, EPS, and PSA, a specificity of 93.3% and 
sensitivity of 93.S%, with an overall accuracy of 93.S% was achieved in differentiating 
30 normal (N/BPH) fi^om cancer (PIN/PCA). 

B. Detection of Markers: Comparison with Controls 

Detection of markers can usefully involve comparing the test data to a 
control. The control can be useful for calibration of '"normal" amounts and for 
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quantifying the amount of a marker against a normal or control amoimt. For example, it 
can be useful to know whether a marker present in both normal and pathologic conditions 
is increased by 1.5 fold, by 2 fold, by 5 fold, or by 10 fold compared to the control 
amount. 

In one embodiment, the control amount can be an amount of a marker 
present in comparable samples from normal patients. The control amount is measured 
under the same or substantially similar experimental conditions as in measuring ttie test 
amount. For example, if a test sample is obtained from a subject's seminal plasma and a 
marker is detected using a particular probe, then a control amount of the marker is 
preferably determined from a seminal plasma sample of a patient using the same probe. It 
is preferred that the control amoimt of marker is determined based upon a significant 
number of samples from subjects who do not have prostate cancer (e,g.y BPH or negative 
diagnosis patients) so that it reflects variations of the marker amoimts in that population. 

C. Correlatioii By Computer 

Data generated by mass spectrometry is usefully analyzed by computer 
software. The software can comprise code that converts signal from tiie mass 
spectrometer into computer readable form. The software also can include code that 
applies an algorithm to the analysis of the signal to determine whether the signal 
represents a "peak" in the signal corresponding to a marker of this invention, or other 
useful markers. The software also can include code that executes an algorithm that 
compares signal from a test sample to a typical signal characteristic of **normal," "BPH" 
and •TCA" and determines the closeness of fit between the two signals. The software 
also can include code indicating which the test sample is closest to, thereby providing a 
probable diagnosis. The software can also include code that executes an algorithm which 
comprises an artificial intelligence program. Examples of artificial intelligence programs 
include fiizzy logic, cluster analysis or neural networks. Executing the artificial 
intelligence program can include code indicating that the test sample correlates most 
closely with a normal, BPH, PIN or PCA sample, thereby providing a probable diagnosis. 

Data generatidn in mass spectrometry begins with the detection of ions by 
an ion detector Ions that strike the detector generate an electric potential that is digitized 
by a high speed time-array recording device that digitally captures the analog signal. 
Ciphergen's ProteinChip® system employs an analog-to-digital converter (ADC) to 
accomplish this. The ADC integrates detector output at regularly spaced time intervals 
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into time-dependent bins. The time intervals typically are one to four nanoseconds long. 
Furthermore, the time-of-flight spectrum ultimately analyzed typically does not represent 
the signal from a single pulse of ionizing energy against a sample, but rather the sum of 
signals from a number of pulses. This reduces noise and increases dynamic range. This 

5 time-of-flight data is then subject to data processing. In Ciphergen's ProteinChip® 
software, data processing typically includes TOF-to-M/Z transformation, baseline 
subtraction, high frequency noise filtering. 

TOF-to-M/Z transformation involves the application of an algorithm that 
transforms times-of-flight into mass-to-charge ratio (M/Z). In this step, the signals are 

10 converted from the time domain to the mass domain. That is, each time-of-flight is 
converted into mass-to-charge ratio, or M/Z. CaUbration can be done internally or 
externally. In internal calibration, the sample analyzed contains one or more analytes of 
knovm M/Z. Signal peaks at times-of-flight representing these massed analytes are 
assigned the known M/Z. Based on these assigned M/Z ratios, parameters are calculated 

15 for a mathematical function that converts times-of-flight to M/Z. In external calibration, 
a function that converts times-of-flight to M/Z, such as one created by prior intemal 
calibration, is applied to a time-of-flight spectrum without the use of intemal calibrants. 
Baseline subtraction improves data quantification by eliminating artificial, reproducible 
instrument of&ets that perturb the spectrum. It involves calculating a spectrum baseline 

20 using an algorithm that incorporates parameters such as peak width, and then subtracting 
the baseline fix>m the mass spectrum. 

High frequency noise signals are eliminated by the application of a 
smoothing function. A typical smoothing function applies a moving average function to 
each time-dependent bin. In an improved version, the moving average filter is a variable 

25 width digital filter in which the bandwidth of the filter varies as a function of, e.g., peak 
bandwidth, generally becoming broader with increased time-of-flight. See, e.g., WO 
00/70648, November 23, 2000 (Gavin et al., "Variable Width Digital Filter for Time-of- 
flight Mass Spectrometry*)* 

Analysis generally involves the identification of peaks in the spectrum that 

30 represent signal from an analyte. Peak selection can, of course, be done by eye. 

However, software is available as part of Ciphergen's ProteinChip® software that can 
automate the detection of peaks. In general, this software functions by identifying signals 
having a signal-to-noise ratio above a selected threshold and labeling the mass of the peak 
at the centroid of the peak signal. In one useful application many spectra are compared to 
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identify identical peaks present in some selected percentage of the mass spectra. One 
version of this software clusters all peaks appearing in the various spectra within a 
defined mass range, and assigns a mass (M/Z) to all the peaks that are near the mid-point 
of the mass (M/Z) cluster. 
5 Peak data fix>m one or more spectra can be subject to further analysis by, 

for example, creating a spreadsheet in which each row represents a particular mass 
spectrum, each column represents a peak in the spectra defined by mass, and each cell 
includes the intensity of the peak in that particular spectrum. Various statistical or pattern 
recognition approaches can applied to the data. 

10 The spectra that are generated in embodiments of the invention can be 

classified using a pattern recognition process that uses a classification model. In general, 
the spectra will represent samples fi-om at least two different groups for which a 
classification algorithm is sought. For example, the groups can be pathological v. non- 
pathological (e.g., cancer v. non-cancer), drug responder v. drug non-responder, toxic 

15 response v. non-toxic response, progressor to disease state v. non-progressor to disease 
state, phenotypic condition present v. phenotypic condition absent. 

In some embodiments, data derived firom the spectra (e.g., mass spectra or 
time-of-flight spectra) that are generated using samples such as "known samples" can 
then be used to '^ain" a classification model. A "known sample" is a sample that is pre- 

20 classified. The data that are derived firom the spectra and are used to form the 
classification model can be referred to as a "training data set". Once trained, the 
classification model can recognize patterns in data derived firom spectra generated using 
unknown samples. The classification model can then be used to classify the unknown 
samples into classes. This can be xisefiil, for example, in predicting whether or not a 

25 particular biological sample is associated with a certain biological condition (e.g., 
diseased vs. non diseased). 

The training data set that is used to form the classification model may 
comprise raw data or pre-processed data. In some embodiments, raw data can be 
obtained directly firom time-of-flight spectra or mass spectra, and then may be optionally 

30 **pre-processed" as described above. 

Classification models can be formed using any suitable statistical 
classification (or "learning") method that attempts to segregate bodies of data into classes 
based on objective parameters present in the data. Classification methods may be either 
supervised or unsupervised. Examples of supervised and unsupervised classification 
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processes are described in Jain, "Statistical Pattern Recognition: A Review", IEEE 
Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 1, January 2000, 
which is herein incorporated by reference in its entirety. 

In supervised classification, training data containing examples of known 
S categories are presented to a learning mechanism, which learns one more sets of 

relationships that define each of the known classes. New data may then be applied to the 
learning mechanism, which then classifies the new data using the learned relationships. 
Examples of supervised classification processes include linear regression processes (e.g., 
multiple linear regression (MLR), partial least squares (PLS) regression and principal 
10 components regression (PGR)), binary decision trees (e.g., recursive partitioning 

processes such as CART - classification and regression trees), artificial neural networks 
such as backpropagation networks, discriminant analyses (e.g., Bayesian classifier or 
Fischer analysis), logistic classifiers, and support vector classifiers (support vector 
machines). 

IS A prefOTed supervised classification method is a recursive partitioning 

process. Recursive partitioning processes use recursive partitioning trees to classify 
spectra derived firom unknown samples. Further details about recursive partitioning 
processes are in U.S. Provisional Patent Application Nos. 60/249,835, filed on November 
16, 2000, and 60/254,746, filed on December 11, 2000, and U.S. Non-Provisional Patent 

20 Application Nos. 09/999,081, filed November 15, 2001, and 10/084,587, filed on 
Febraary 25, 2002. All of these U.S. Provisional and Non Provisional Patent 
Applications are herein incorporated by reference in their entirety for all purposes. 

In other embodiments, the classification models that are created can be 
formed using xmsupervised learning methods. Unsupervised classification attempts to 

25 learn classifications based on similarities in the training data set, without pre classifying 
the spectra firom which the training data set was derived. Unsupervised learning methods 
include cluster analyses. A cluster analysis attempts to divide the data into ''clusters" or 
groups that ideally should have members that are very similar to each other, and very 
dissimilar to members of other clusters. Similarity is then measured using some distance 

30 metric, which measures the distance between data items, and clusters together data items 
that are closer to each other. Clustering techniques include the M ac(^een's K-means 
algorithm and the Kohonen's Self-Organizing Map algorithm. 

The classification models can be formed on and used on any suitable 
digital computer. Suitable digital computers include noicro, mini, or large computers 
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using any standard or specialized operating system such as a Unix, Windows™ or 
Linux™ based operating system. The digital computer that is used may be physically 
separate from the mass spectrometer that is used to create the spectra of interest, or it may 
be coupled to the mass spectrometer. 

The training data set and the classification models according to 
embodiments of the invention can be embodied by computer code that is executed or used 
by a digital computer. The computer code can be stored on any suitable computer 
readable media including optical or magnetic disks, sticks, tapes, etc., and can be written 
in any suitable computer programming language including C, C-H-, visual basic, etc. 

Using pattem recognition techniques discussed above, classification 
algorithms may be developed from the classification models. These are highly usefiil in 
aiding the diagnosis of prostate cancer, prostate intraepithelial neoplasia , or benign 
prostate hyperplasia versus a negative diagnosis, using the markers of the present 
invention. By screening samples fix)m sources known to have expressed each condition, 
the training data set, the pattem recognition techniques described herein identify 
difference patterns in the marker composition of diseased tissue relative to normal tissue. 
These difference patterns may comprise differences in the presrace and/or amount of a 
particular marker or markers. Difference patterns may be built from a statistical analysis 
of several test data sets using methods well known to those of skill in the art. The 
difference patterns generated may tiien be used to constract classification models, as 
described above, and used in comparison studies to diagnose disease states of patients. 



V. KITS 

In yet another aspect, the invention provides kits for aiding a diagnosis of 
prostate cancer, wherein the kits can be used to detect the markers of the present 
invention. For example, the kits can be used to detect any one or combination of markers 
described above, which markers are differentially present in samples of a prostate cancer 
patient, BPH and normal patients. The kits of the invention have many applications. For 
example, the kits can be used to differentiate if a subject has prostate cancer, BPH or has 
a negative diagnosis, thus aiding a prostate cancer diagnosis. In another example, the kits 
can be used to identify compoimds that modulate expression of the markers in in vitro 
prostate cells or in vivo animal models for prostate cancer. 
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In one embodiment, a kit comprises: (a) a substrate comprising an 
adsorbent thereon, wherein the adsorbent is suitable for binding a marker, and (b) a 
washing solution or instructions for making a washing solution, wherein the combination 
of the adsorbent and the washing solution allows detection of the marker using gas phase 
5 ion spectrometry. Such kits can be prepared from the materials described above, and the 
previous discussion of these materials {e.g., probe substrates, adsorbents, washing 
solutions, etc.) is fully applicable to this section and need not be repeated. 

In some embodiments, the kit may comprise a first substrate comprising an 
adsorbent thereon {e.g., a particle functionalized with an adsorbent) and a second 
10 substrate onto which the first substrate can be positioned to form a probe which is 

removably insertable into a gas phase ion spectrometer. In other embodiments, the kit 
may comprise a single substrate which is in the form of a removably insertable probe with 
adsorbents on the substrate. 

Optionally, the kit may fiirther comprise instructions for suitable 
1 5 operational parameters in the form of a label or a sq>arate insert. For example, the kit 
may have standard instructions informing a consumer how to wash the probe after a 
sample of seminal plasma is contacted on the probe. 

In another embodiment, a kit comprises (a) an antibody that specifically 
binds to a marker, and (b) a detection reagent. Such kits can be prepared fix>m the 
20 materials described above, and the previous discussion regarding the materials (e.^., 
antibodies, detection reagents, immobilized supports, etc.) is fiilly s^plicable to this 
section and need not be repeated. 

In either embodiment, the kit may optionally fiirther comprise a standard 
or control information so that the test sample can be compared with the control 
25 information standard to determine if the test amount of a marker detected in a sample is a 
diagnostic amount consistent with a diagnosis of prostate cancer. 

EXAMPLES 

The following examples show the identification of markers that have 
positive or negative correlations with prostate cancer, prostate intraepithelial neoplasia, or 
30 benign prostate hyperplasia. Markers were determined from samples of cell lysates 
isolated by laser capture microdissection. 
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VI. Tissue Specimens 

Prostate tissues were procured from consenting patients undergoing radical 
prostatectomy, following approved protocols. The tissues were processed immediately 
and stored in the Virginia Prostate Center's Bio-repository. Tissue pieces harvested for 
5 LCM were immediately embedded in OCT and stored at -80**C. One cryosection was 
stained with H&E and examined by a pathologist to identify cells of interest. Mirrored 
H&E stained sections, fixed in formalin and paraffin embedded, were also used to fiirther 
aid in the identification of specific cell types. Additional serial firozen sections at 8 
microns were used for microdissection. 

10 VII. Laser Capture Microdissection 

Populations of normal luminal epithelia. Benign Prostatic Hyperplasia 
(BPH), Prostate Intraepithelial Neoplasia (PIN)» and Prostate Adenocarcinoma (PCA) 
epithelial cells were microdissected firom fit)zen tissue sections using the PixCell^n 
Laser Capture Microdissection Microscope (Arcturas Engineering, Inc., Mountain View, 

15 CA) essentially as described by Emmert-Buck et al. Science, 274: 998-1001 (1996). See 
also United States Patent 5,843,644 (Liotta et al) and United States Patent 5,843,657 
(Liotta et al). The procedure for staining fi-ozen sections for LCM was followed with 
slight modifications: the hematoxylin step was omitted and protease inhibitors (Complete, 
Roche Biomedical, Indianapolis, IN) were added to the staining baths. A total of 1000 

20 laser shots totaling 3000-6000 cells were procured for each cell type. Matched benign 
and diseased epithelial cell types were harvested firom each prostate sample. In some 
cases, adjacent stroma cells were also prociured. Samples were standardized by total 
number of laser shots, and duplicate samples were captured from the same areas of each 
serial section to check reproducibility. 

25 

Vin. Laser Desorption/Ionizatioii Mass Spectrometry 

Cell lysates were immediately prepared after microdissection by adding 4 
H\ of the lysis buffer containing 20 mM HEPES pH 8.0 with 1% Triton X-100 directly on 
the LCM cap. Each lysate was diluted 1 : 10 in PBS buffer, giving a total volume of 40 ul. 
30 Lysates were vortexed for 10 minutes at 4'*C and centrifiiged briefly to remove cellular 
debris. The supernatant was added to an IMAC3 (Immobilized Metal Affinity Capture) 
ProteinChip® Array (Ciphergen Biosystems Inc. Fremont, CA), pretreated with 100 mM 
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CUSO4, following the manufacturer's instructions. A bioprocessor (Ciphergen 
Biosystems) was fitted on top of the chip arrays to permit the addition of the 40 ul 
sample. To control for variation, cell lysates harvested from each prostate tissue were 
analyzed on a single Biochip. The array was then incubated with the diluted lysate 
5 overnight with continuous shaking. After removal of the lysate, each spot was washed 
twice with PBS followed by a final water rinse. 



acid (Ciphergen Biosystems) in 50%(v/v) acetonitrile, 0.5%(v/v) trifluoroacetic acid was 
added to each spot. Time of flight mass spectra were generated in a Ciphergen Protein 
10 Biology System II (PBS II) by averaging 120 laser shots collected in the positive mode at 
laser settings of 225 and 280. Data was calibrated externally using purified peptide and 
protein standards. 



abundances for each peak were calculated as follows. The relative abundance of the 
IS proteins were subdivided into three classes: Low (+) equates to 1-30% of spectral scale. 
Medium (++) equates to 31-60%, and High (+++) equates to 60-100%. Numerical values 
were then assigned to the abundance levels (i-e. (+) equates to 33, (++) equates to 66, 
(+++) equates to 100) and averaged for each cell type for the purpose of representing the 
protein expression between prostate cell types in graphical form. 
20 Sensitivity is defined as the percent of diseased (BPH/PIN/PCA) cell types 

for which the biomarker of interest is present, (true positive/total number of diseased 
lysates x 100). Specificity is defined as the percent of cell types for which the biomarker 
of interest is not positive, (trae negative/total number of lysates without disease x 100), 

IX. Markers in Cell Lysates from Prostate Epithelial Tissue 
25 A. IMAC3 ProteinChip 



The arrays were allowed to air dry and a saturated solution of sinapinic 



Spectra were analyzed with the Ciphergen Peaks 2.1 software and relative 



30 



2. 



3. 



1. 



1. Protocol: 

Draw a hydrophobic ring around each adsorbent spot on the chip. 
Load lOjil 100 mM CUSO4. Incubate chip for 10 minutes in a moist 
chamber with shaking. 

Wash Chip with DI H2O 10 seconds, around spots and remove excess 



water. 
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4. Load \0\il 100 mM sodium acetate, pH 4.0. Incubate chip for 10 minutes 
in a moist chamber with shaking. 

5. Wash Chip with DI H2O and allow to air dry. 

6. Lyse microdissected cells in 4^1 of 20 mM HEPES 1% Triton X-100 

S directly on the cap. Pipette the lysis buffer up and down on top of the cells 

until lysis occurs. Dilute lysate 1:10 in PBS and vortex cell lysate 10 
minutes at 4^C and spin down. 

7. Spot the entire lysate (40^1) on spot and incubate in a moist chamber 
overnight with continuous shaking. 

10 8. Wash each spot 2x with 40^1 of PBS. 

9. Wash 1 X 40jil with water and air dry. 

10. Spot 2x 0.5^1 sins^inic acid in 50% acetonitrile» .5% trifluoroacetic acid. 

11. Airdry. 

2. SELDI Reader Data Collection 

15 1. Peaks were collected on the Ciphergen Biosystems PBSn at a laser 

intensity of 225 and 280 with a detector sensitivity of 10. 120 shots were 
collected and averaged. 
2. Data was calibrated externally using purified peptide and protein 
standards. 

20 B. Results 

Pure populations of matched benign (nomial or BPH), PIN (high grade) 
and/or PCA epithelial cells were obtained &om nine prostatectomy specimens examined. 
Stroma cells were also microdissected from a selected subset of tissues to compare to the 
epithelial cell profiles. A total of 80 cell lysates firom an average of 5000 cells/protein 

25 array were analyzed by SELDI time-of flight mass spectrometry. Processing the lysates 
on an immobilized metal affinity capture surface pre-treated with CUSO4 resolved an 
average of 70'protein or peptide peaks in the range from 3 kDa to 100 kDa. 

Figure 1 is a representative spectrum of the protein profile of a PCA cell 
lysate in the mass range of 3000-80,000 Da. The advantages of the SELDI technology in 

30 resolving small molecular weight protein or peptide species is evident, since the majority 
of peaks observed were below 20 kDa. Figure la shows an abimdant set of peaks 
observed as a triplet, with the predominant peak at m/z of 3459. This cluster was 
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observed in most cell lysates analyzed, stromal and epithelial, and has previously been 
identified in our laboratory as defensin ct-l from the urine of bladder cancer patients using 
a SELDI immunoassay (see e.g. Vlahou et aL, Am. J. Pathol, 158: 1491-1502, (2001)), 
A peak at m/z of 28445 (Figure Ic) is most likely free PSA, Serum albimiin (HSA), a 

5 peak at m/z of 66866 was also present in all cell lysates tested (see Figure Ic). 

Differential protein expression between cell types was evaluated by 
determining the abundance (i.e., over-expression or under-expression) of each peak 
relative to each matched cell type. Figure 2a is a an example of a group of peaks between 
4000 Da and 6000 Da showing differential expression between the epithelial cell types 

10 obtained from the same prostate tissue specimen. Three of these peaks (m/z 4030 (EP2), 
4358 (EP3), 4753 (EP6)) occur in greater frequency and abimdance in the PIN and PCA 
lysates. As a general rule, peptide or protein species found in greater abimdance in the 
PCA profiles, were also found in increasing amounts in the PIN profiles. Figure 2b 
shows file over-expression of the peak with an average m/z of 4749 (EP6) in the PIN and 

1 5 PCA lysates from three different prostatectomy specimens. Duplicate samples exhibited 
a high degree of reproducibility. In contrast, regions of hetm>geneity were present in the 
protein profiles derived firom two different foci of PCA in Patient #2 and two foci of PIN 
in Patient #3. Profile differences were also found in the BPH cell lysates. Figure 2c is an 
example of a peak at m/z of 5666 (EP8X which ^pears in grater abundance in the BPH 

20 epithelial cells when compared to the spectra of the other matched cell types. With few 
exceptions, the protein patterns found in BPH cell lysates more closely resembled the 
non[nal epithelial profile. 

The profiles of proteins greater than 10 kDa also demonstrated some 
differential expression patterns. One peak with an average m/z 28442 was found in lower 

25 abundance in the BPH, PIN and PCA epithelial cells compared to profiles of matched 
normal cell lysates. This mass is consistent with the molecular weight of free PSA 
(Belanger et al, Prostate 27:187-197 (1995)). An example of this decrease in 
intracellular PSA levels for PIN and PCA cells can be seen in Figure 3. A large PSA 
peak at 28393 Da was present in the nomial prostate epithelia but absent firom adjacent 

30 stroma cells. The PIN and PCA cells have a greatly reduced PSA peak. Normal 

epithelial cells procured directly adjacent to the PCA foci also exhibited a large amount of 
intracellular PSA. The 66838 Da peak is albumin and was present in all cell lysates at 
high abundance. 
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These comparative analyses suggested that several proteins showed 
differential expression between the four cell types. However, to more precisely select 
those with high biomarker potential, it was necessary to standardize peak expression 
levels throughout the matched populations. The relative abundance of the proteins were 
5 subdivided into three classes: Low (+) equates to 1-33% of spectral scale. Medium (++) 
equates to 33-66%, and High (+-H-) equates to 66-100%. Each individual spectrum was 
expanded in specific mass ranges to assist in determining the relative abundance of each 
observed peak. An in-house computer program was used to cluster the peaks to obtain an 
average mass for each peak with a mass error range of 0. 15%. From an average of 70 

10 peaks observed, 14 (20%) displayed expression differences between the epithelial cell 
profiles (Table 1). The scores in Table 1 were converted to numerical values and plotted 
in a histogram showing the average differences in expression levels between cell types 
subtracted from the expression level in normal epithelia (Figure 4). The over-expression 
and under-expression for these 14 protein/peptides was '^normalized" and evaluated for 

15 BPH, PIN and PCA cells. Generally, the PCA and PIN levels of expression for protein or 
peptides were similar. As shown in Figure 4, there were higher levels of a group of small 
molecular weight peptides/proteins between 3000 and 6000 Da in the PIN and PCA 
profiles. The first of these, 3448 Da, was found increased in BPH, PIN, and PCA cell 
lysates. Three peaks (4036 Da (EP2), 4361 Da (EP3), and 4749 Da (EP6)) were over- 

20 expressed in PIN and PCA with the highest level of over-expression in PCA lysates. 

Note that none of the BPH cell lyssAes exhibited the 4036 Da (EP2) peak. A summary of 
the percent of each cell type exhibiting those most significantly increased or decreased 
proteins is presented in Table 2. 
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Table 1: Comparison of the abundance levels* of proteins in lysates of LCM procured 
prostate cells (epithelial and stromal). ^Relative abundance of the proteins: + = 1-33% of 
spectral scale, ++ = 33-66%, +++ = 66-100%, 0 = peak not detected. 

mwfPa^ 3448 4036 4361 4413 4639 4749 4827 5666 8445 11744 14696 24184 28422 48308 S3830 
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. Two peaks (4639 Da (EPS) and 24184 Da (EP12)) were specific to PCA 
lysates in 43 % of the PCA samples tested. The peak at 4749 Da CEP6) was over- 
expressed in 67% and 57% of PIN and PCA lysates, respectively, with the highest overall 
peak expression observed in the PCA lysates. A 14696 Da (EPl 1) peak was present in 
56% of the PIN cell lysates. Also of note, was a large increase in expression of a peak at 
5666 Da ^8) found in 86% of the BPH profiles. A peak at m/z 4413 (EP4) and two 
high molecular weight proteins (48208 Da (EPI3) and 53836 Da (EP14)) were also found 
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to have increased expression in BPH cells. As shown in Figure 4, a few proteins were 
under-expressed in BPH, PIN, and PCA cells where compared to the expression levels in 
normal epithelial cells. Expression of the 5666 Da (EPS) peak, which was over-expressed 
in BPH profile, was reduced in 22% of PIN ad 71% of PCA cell lysates at significant 
5 levels (Table 2). In addition, decreased expression of an 1 1744 Da protein (EPIO) and 
PSA (28422 Da) was found in 56 % and 57% of the PIN and PCA cell lysates. The 
reduced expression of intracellular PSA was found in 56% and 57% of PIN and PCA cell 
lysates, respectively. 

10 Table 2: Percent of prostate cells lysates (BPH/PIN/PCA) showing differential expression 
of selected proteins {Le, greater or lesser abundance) when compared to the expression in 
matched normal prostate epithelial cell lysates. 

BPH PIN PCA 

15 % of sample % of sample % of sample 



average 


Greater 


Lesser 


Greater 


Lesser 


Greater 


Lesser 


TO/Z 


abundance 


abundance 


abundance 


abundance 


abundance 


abundance 
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44 
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44 
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The significance of the expression differences between cell types was 
evaluated using the Wilcoxon signed rank test (Table 3). E;q>ression diif^ences were 

20 found to be significant (p<.05) for BPH vs. PIN and BPH vs. PCA for over-expression of 
the 4036 Da (EP2) protein (Table 3). A comparison of N vs. PCA approaches 
significance (p=.059) for this peak. Similarly, significant differences were observed in N 
vs. PCA, and BPH vs. PIN in over-expression of the 4361 (EP3) peak. Over-expression 
of the 4749 Da (EP6) peak was significant for N vs. PIN and approaches significance for 

25 BPH vs. PIN (Table 3). The 5666 Da (EPS) protein was significantly over-expressed in 
N vs. BPH and, interestingly, is significantly under-expressed in BPH vs. PIN and BPH 
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VS. PC A. The under-expression of PSA was significant in N vs. PIN and approaches 
significance (p=.066) for N vs. PCA, 



Table 3: Wilcoxon signed ranks test of peak expression differences. 



Peak (Ds^ 


N vs. BPH 


N vs. PIN 


N vs. PCA 


BPH vs. 
PIN 


BPH vs. 
PCA 


PIN vs. 
PCA 


4036 


0.317 


0.083 


0.059 


0.025* 


0.039' 


0.102 


4361 


0.564 


0.059 


0.038' 


0.046» 


0.102 


0.083 


4413 


0.083 


0.157 


0.317 


0.655 


1.000 


0.317 


4639 


1.000 


1.000 


0.083 


1.000 


0.157 


0.083 


4749 


0.317 


0.034* 


0.102 


0.059 


0.109 


0.564 


5666 


0.038» 


1.000 


0.059 


0.024* 


0.039» 


0.408 


28422 


0.083 


0.041* 


0.066 


0.680 


0.180 


0.564 


(PSA) 















» significant p-values ( ^.05), based on over-expression as compared to N or BPH 
* significant p-values ( ^.05), based on under-expression as compared to N or BPH 



5 

Since no single peptide or protein was discovered to be present, or 
differentially expressed in all the PIN or PCA profiles, various combinations of selected 
proteins were evaluated to identify a panel of markers (expression levels) that could 
improve disease classification of each specific cell type. A biomarker combination was 

1 0 classified as positive if any marker in the combination was present in a sample, and 

negative if none of the markers were detected in a specimen. Since these markers were 
over-expressed in both PCA and PIN, the combination of the markers did not improve the 
specificity for each diseased cell type individually. Better discrimination was achieved 
by combining the benign cell types (N, BPH) and comparing fliem to diseased cell types 

1 5 (PIN, PCA). The sensitivity and specificity of each marker for benign versus disease was 
then calculated, both individually and in additive combinations. As shown in Table 4, by 
combining markers EP2 (m/z 4036) and EPS (m/z 4639), there was an improvement in 
the sensitivity for PCA from 71% to 86%, while maintaining a specificity of 100% for 
PIN or PCA combined. This combination of markers also identified 44% of the PIN 

20 samples. Combining marker EP3 (m/z 4361) and marker EPS (m/z 4639) increased the 
sensitivity for PCA to 100%, while the specificity decreased to 87% for PIN and PCA. 
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By combining markers EP3 and EP6 (m/z 4749), 100% of the PCA and 89% of the PIN 
samples could be correctly identified, with a slight decrease in specificity. Combinations 
involving three or more of the markers did not improve tiie overall specificity and 
sensitivity. The 5666 Da protein, (marker EPS, Table 4) had a sensitivity of 86% and 
5 specificity of 88% for detecting BPH. This marker also provided some discrimination 
between BPH and PIN lysates having 22% sensitivity and 68% specificity for PIN 
epithelia. 

Table 4: Sensitivity and specificity of single markers and combinations of markers for the 
10 detection of abnormal (diseased) cells. *The increase in abundance of each marker (EP2- 
EP6) foimd in PIN and PCA epithelial cells was determined from matched normal or 



BPH epithelial cell lysates. The sensitivity and specificity of a marker (EP8) found in 
abimdance in BPH samples when compared to PIN and PCA samples. 







Srasitivity 


Specificity 


Specificity 


Maiker 


E3g>ressioii* 


PCA 


PIN 


PCA 


PIN 


PIN or PCA 


EP2 


inciease of m/z 4036 


71% 


44% 


83% 


77% 


100% 


EP3 


increase of m/z of 4361 


71% 


44% 


7S% 


68% 


87% 


EPS 


presence of m/z 4639 


43% 


0% 


100% 




100% 


EP6 


increase of m/z of 4749 


S7% 


67% 


71% 


71% 


93% 


Combinatioii 


EP2 + EP3 


86% 


67% 


67% 


64% 


87% 




EP2 + EPS 


86% 


44% 


83% 


77% 


100% 




EP3 + EPS 


100% 


56% 


7S% 


68% 


87% 




EP2-t-EP6 


86% 


78% 


67% 


68% 


93% 




EP3 + EP6 


100% 


89% 


54% 


55% 


80% 




»5 + EP6 


86% 


67% 


71% 


68% 


93% 




EP2 + EP3 + EPS 


86% 


67% 


67% 


64% 


93% 




EP2 + EP3 + EP6. 


100% 


89% 


54% 


59% 


80% 






BPH 


PIN 


BPH 


PIN 




EP8 •* 


increase of m/z S666 


86% 


22% 


88% 


68% 





15 Multivariable analysis of the peak expression data was evaluated to 

determine if the simultaneous over-expression or under-expression of several proteins in 
combination could be used to predict benign versus diseased cell lysates. Logistic 
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regression analysis was performed with the seven most significant differentially 
expressed peaks (4036 Da, 4413 Da, 4639 Da, 4729 Da, 5666 Da, and 28422 (PSA)). 
Table 5 shows a predictive equation based on diseased (PIN/PCA) versus benign 
(N/BPH), resulted in 93.3% specificity and 93.8% sensitivity for PIN or PCA cell lysates. 

5 

Table 5: Classification table based on logistic regression analysis using expression levels 
of seven proteins 4036 Da (EP2), 4361 Da (EP3), 4413 Da (EP4), 4639 (EP5), 4749 Da 
EP6), 5666 Da (EP8), 28422 Da (PSA). 







predicted 








PIN/PCA 


% 




observed 


N 


Y 


correct 


PIN/PCA 


N 


15 


14 


1 


93^ (specificity) 




Y 


16 


1 


15 


93.8 (sensitivity) 












93.5 (overall) 



The development of cancer is a multistep process involving many events 
in the expression of oncogenes and tumor suppressor genes. Single changes in gene 
expression can lead to the differential expression of a myriad of proteins. Analyzing the 

15 multiple protein changes that occur in prostate disease progression is a difficult task, 
made even more daunting by the biological heterogeneity of the disease. Advances in 
time of flight mass spectroscopy, resulting in the SELDI technology, have lead to a 
method for the sensitive and direct analysis and profiling of proteins in complex 
biological samples. Previous studies in this and other laboratories have profiled a limited 

20 number of prostate cell specimens {see e.g. Paweletz, ei aL Drug Development Research 
49: 34^2 (2001), Wright, ei aL, Prostate Cancer and Prostate Diseases 2: 264-267 
(2000), and Paweletz, et aL Urology 57: 160-163 (2001)). In this study we combmed 
SELDI with LCM to produce protein profiles Srom 40 prostate cell lysates derived 6om 
nine prostatectomy specimens. Our results demonstrate that the protein profiles of 

25 normal prostate epithelia, BPH, PIN and PCA display discriminating differences. 

Protein extracts prepared fi-om prostate cell types were analyzed using an 
IMAC3 protein biochip pretreated with CUSO4. An average of 70 protein peaks were 
detected firom the cell lysates using this surface. Overall there were a relatively large 
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niunber of common peaks in the benign and diseased epithelial cell profiles, and very few 
peaks that would, based on presence or absence, be candidate biomarkers for the disease 
progression and/or diagnosis of prostate cancer. It was therefore determined that 
calculating abimdance levels of peaks would enhance the significance of the results and 
5 identify possible expression differences between the samples. Expression levels were 
calculated for diseased cell profiles as compared to expression levels found in matched 
benign cell types. Out of the 70 peaks observed, 14 of these (20%) displayed 
deregulation in the diseased profiles. 

Several small molecular weight peptides or proteins (3000 to 6000 Da) had 

10 increased expression levels in the PIN and PCA cell extracts. The cluster of peaks in this 
range could have originated from proteolysis, or may be cleavage products of larger 
proteins, but were consistent enough in common cell types to be considered part of the 
general profile. However, even cleavage products of proteins may be indicative of 
changes occurring in the prostate disease cycle, as demonstrated in a study of 

15 semenogelin, fix>m seminal plasma {see Merchant, et aL, Electrophoresis, 21: 1 164-1 177 

(2000) ). Furtheraiore, since the IMAC surface has the ability to bind phosphorylated 
pqitides or proteins it is feasible that some of the changes observed are due to differential 
phosphorylation of the proteins or peptides in the diseased cell types. Two peaks 
identified in this study with an average mass 4827 Da ± 26.5 (EP7) (common in the 

20 benign cell types) and 4749 Da ± 26.1 (EP6) (overexpressed in PIN/PCA, see Table 1) 

may represent a de-phosphorylation event occurring in the PIN and PCA cell lysates. The 
average mass shift between these proteins is 78 Da, close to the calculated mass shift of 
79 Da for a 5' de-phosphorylation. Phosphorylation is one of the most important post- 
translational modifications of proteins and enables regulation of many physiological cell 

25 processes (transport, proliferation, differentiation). Examples of aberrant phosphorylation 
of proteins found in cancer studies include ERK 1/ 2 in breast cancer (see Gee, ei aL Int. 
J. Cancer 95: 247-254 (2001)), and androgen receptor in prostate cancer (see Wang, et 
aL, Biochem. Biophys. Res. Commun., 259: 21-28 (1999). 

Alternatively, these small molecules could themselves be intact fimctional 

30 proteins or peptides, examples of which include pro-hormones, growth factors, amidated 
peptides, and defensins. Inarecentstudy by Rocchi era/.. Cancer lies., 61: 1196-1206 

(2001) PC-3 and Dul45 prostate cancer cell lines were found to produce and secrete a 
multifunctional amidated peptide. Inununohistochemical staining was also performed on 
human prostate tissue and showed increased levels of adrenomedullin staining in 
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cancerous epithelia. The authors further demonstrated the activity of the enzyme 
peptidylglycine alpha-amidatin monooxygenase (PAM) in prostate cancer cell lines, 
which produces alpha-amidated bioactive peptides from their inactive glycine-extended 
precursors. C-Terminal amidation is common in peptide hormones, and these peptides 

5 are produced in neuroendocrine cells, known to be present in prostate carcinoma, and 
function as growth factors. 

Previous studies have shown a cytogenetic link between high grade PIN 
and Prostate Cancer strengthening its role as a precursor lesion {see Alcaraz, et aL, 
Prostate, 47: 29-35 (2001)). In addition, greater than 50% of patients with high grade 

10 PIN have cancer detected on a subsequent biopsy (see Park, et al, J. UroL 165: 1409- 

14140 (2001). Therefore, the identification of proteins associated with PIN may be useful 
as markers of early detection. In this study PIN and PCA cell lysates exhibited similar 
protein profiles. Three peaks at 4036 Da (EP2), 4361 Da (EP3), and 4749 Da (EP6), 
showed increased abundance in PIN and PCA lesions when compared to matched benign 

15 cell profiles. Two other peaks at 1 1744 (EPIO), and 28442 Da (free PSA) had decreased 
abundance in BPH, PIN and PCA cell extracts, and one marker at 14696 Da (EPl 1) was 
over-expressed m 56% of the PIN samples and only 29% of matched PCA samples. 
Interestingly, a few of these peaks, based on closely matched molecular weights, were 
found to be present in a matched seminal plasma fix>m one of the patients donating tissue 

20 for this study (data not shown). Since these seminal plasma profiles were generated using 
the IMAC surface pre-treated with CUSO4, they may represent the same protein, however, 
additional matched seminal plasma samples will need to be examined to confirm this 
result. The identification of these peptides or proteins and their use as possible markers 
of early detection is currently under investigation in our laboratory. 

25 The independent evolution of PIN and tumor foci is a conunon feature of 

primary tumors of the prostate, making a panel of biomarkers the most likely solution to 
improvements in early detection and diagnosis. A combination of biomarkers with the 
most significant differential expression was therefore investigated. Since most of the 
markers found in the PCA profiles were also present in the PIN profiles the ability to 

30 discriminate these two cell types from each other was difficult; whereas, better 

discrimination could be achieved between the benign (N, BPH) and the diseased cell 
types (PIN or PCA combined). However, a combination of markers 4361 Da (EP3), and 
4749 Da (EP6) increased the sensitivity to 100% for detection of PIN and PCA with a 
only a slight decrease in specificity. Also of note, were two peaks (4639 Da (EP5)and 
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24184 Da (EP12)) found only in PCA cell lysates. Both of these markers were expressed 
in 43% of tumor samples. No correlation of biomarker expression with grade of tumor 
was observed. Further studies using different capture surfaces for SELDI profiling may 
reveal additional PCA biomarkers. Protein profiles of metastatic prostate samples will be 
required to determme if any of the selected ''biomarkers" represent markers of metastatic 
potential. 

SELDI analyses performed on cell lysates of surrounding adjacent nomial 
epithelia as well as stroma was performed to determine if possible field effects accounted 
for the differences in the protein profiles. These analyses revealed interesting information 
on the milieu of proteins present in the diseased prostate cells. Defensins a-1 and Orl 
were present in both epithelial and stroma cells with increased amounts in BPH, PIN and 
PCA cell lysates and surrounding stroma. Defensins are a family of small peptides with 
antimicrobial^ cytotoxic, and antitumor activities {see Lichtenstein, et al, J. Immunol. 
140: 2686-2694 (1988). Two types of defensins have been characterized. jS-defensins are 
primarily expressed by epithelial cells of the kidneys, skin, and respiratory tract, (see 
Yang, et al. Science, 286: 525-528 (1999), and Zhao, et al, FEBSLett, 396: 319-322 
(1996)), whereas, c^efimsins are expr^sed by neutrophils and Paneth cells {see Ayabe, 
et al, Nat. Immunol, 1:113-118(2000)). Recently ofi-defensins have been found in the 
submandibular glands of patients with oral carcinoma {see Mizukawa, et al. Anticancer 
Res,, 20: 1 125-1 127, (2000), and Mizukawa, et al. Anticancer Res., 19: 2969-2971 
(1999)), and, a study foimd odefensin peptides in the urine of bladder cancer patients 
{see Vlahou, et al. Am. J. Pathol, 158: 1491-1502 (2001)). It is possible that neutrophils 
in the siirrounding area are secreting these defensins into prostate tissue in response to an 
inflammatory or disease state. Unfortunately no tissue firom a normal non-diseased 
prostate was available to use as a control to test this hypothesis. Another peak at 4361 Da 
(EP3), foimd to be over-expressed in the PIN and PCA cell extracts, was also present at 
higher levels m stroma adjacent to the diseased cells, suggestmg a possible field effect for 
this peptide. ' 

Differences were observed for the expression of a 28,400 Da peak, which 
is presumed to be intracellular fi-ee PSA The molecular weigjit identified in our study 
closely matches the observed molecular mass of 28,430 Da for fi*ee PSA determined 
using ion spray mass spectroscopy (ISMS) {see Belanger, et al. Prostate, 27: 187-197 
(1995). This peak was also absent bom matched stroma cell lysates, indicating its 
specific expression in epithelial cells. Furthermore, data obtained finom an immunoassay, 
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also performed using the SELDI platform, Wright, et a/.. Prostate Cancer and Prostate 
Diseases 2: 264-267 (2000), has identified this peak fix>m LCM lysates as PSA based on 
immunoaffinity (data not shown). In this study, normal epithelia of the prostate 
expressed large amoimt of PSA whereas the diseased cell types (PIN and PCA) had 

5 reduced expression levels. As expected, PSA was absent firom stroma cells. Normal 
epithelia microdissected directly adjacent to the tumor foci also expressed high levels of 
PSA. This decrease in intracellular PSA in PCA cells is in agreement with other studies. 
For example, Stege et al. Prostate, 38: 183-188 (1999) found tissue PSA levels lower in 
cancerous than in normal tissue, and a study by Weir et al, J. Urol, 163: 1739-1742, 

10 (2000) found immunohistochemical staining intensity of PSA inversely correlated with 
histological grade of the tumor. Interestingly a recent report by Pawletz et al. Drug 
Development Research 49: 34-42 (2001), also found a 28 kDa protein peak (not identified 
as PSA) via SELDI to be down-regulated in microdissected PCA cells when compared to 
matched normal epithelia. Our result confirms the current opinion that the increase in 

1 5 serum PSA in men with prostate cancer is not due to increased production of PSA by the 
tumor cells, but an increased leakage or secretion of PSA into the circulation by the 
surrounding normal epithelium. 

The BPH protein profiles also displayed some notable differences. There 
was an increase in defensin o^l (3448 Da, EPl) in the BPH lesions, as well as an increase 

20 in abundance of peaks at 4413 Da (EP4), and 5666 Da (EP8). Of special interest was the 
over-expression of a peak at 5666 Da (EP8) found in greater abimdance in 86% of the 
BPH cell lysates tested with a specificity of 88%. Only 22% of the PIN lesions, and none 
of the PCA lesions overexpressed this marker. Efforts are underway to characterize this 
marker. A protein biomarker indicative of BPH alone may, if secreted into serum or 

25 seminal plasma, be useful in the prevention of imnecessary biopsies in patients with 

elevated PSA. Two other high molecular weight proteins at 48308 Da (EP13) and 53830 
Da (EP14) were also found at increased abundance in BPH cell l^ates. 

The present invention provides novel materials and methods for aiding 
30 prostate cancer diagnosis using markers that are differentially present in samples pure 
matched benign (normal or BPH), PIN (high grade), and/or PCA cells. While specific 
examples have been provided, the above description is illustrative and not restrictive. 
Any one or more of the features of the previously described embodiments can be 
combined in any manner with one or more features of any other embodiments in the 
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present invention, Furthennore, many variations of the invention will become apparent to 
those skilled in the art upon review of the specification. The scope of the invention 
should, therefore, be determined not with reference to the above description, but instead 
should be determined with reference to the appended claims along with their full scope of 
equivalents. 

All publications and patent documents cited in this application are incorporated by 
reference in their entirety for all purposes to the same extent as if each individual 
publication or patent document were so individually denoted. By their citation of various 
references in this document. Applicants do not admit any particular reference is '*prior 
art" to their invention. 
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